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95 Human Secreted Proteins 
Field of the Invention 

This invention relates to newly identified polynucleotides and the 
polypeptides encoded by these polynucleotides, uses of such polynucleotides and 
5 polypeptides, and their production. 

Background of the Invention 
Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
10 organelle, contains different proteins essential for the function of the organelle. The 
cell uses "sorting signals," which are amino acid motifs located within the protein, to 
target proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a 
leader sequence, directs a class of proteins to an organelle called the endoplasmic 
15 reticulum (ER). The ER separates the membrane-bounded proteins from all other 
types of proteins. Once localized to the ER, both groups of proteins can be further 
directed to another organelle called the Golgi apparatus. Here, the Golgi distributes 
the proteins to vesicles, including secretory vesicles, the cell membrane, lysosomes, 
and the other organelles. 
20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the 
extracellular space - a process called exocytosis. Exocytosis can occur constitutively 
or after receipt of a triggering signal. In the latter case, the proteins are stored in 
25 secretory vesicles (or secretory granules) until exocytosis is triggered. Similarly, 
proteins residing on the cell membrane can also be secreted into the extracellular 
space by proteolytic cleavage of a "linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins 
30 include the commercially valuable human insulin, interferon, Factor VIII, human 



WO 99/47540 PCT/US99/05804 

2 

growth hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of 
the pervasive role of secreted proteins in human physiology, a need exists for 
identifying and characterizing novel human secreted proteins and the genes that 
encode them. This knowledge will allow one to detect, to treat, and to prevent 
5 medical disorders by using secreted proteins or the genes that encode them. 



Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, 
10 antibodies, and recombinant methods for producing the polypeptides and 

polynucleotides. Also provided are diagnostic methods for detecting disorders related 
to the polypeptides, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for identifying binding partners of the 
polypeptides. 

15 

Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

20 In the present invention, "isolated" refers to material removed from its original 

environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 

25 matter, or particular cell is not the original environment of the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable 
of being directed to the ER, secretory vesicles, or the extracellular space as a result of 
a signal sequence, as well as those proteins released into the extracellular space 
without necessarily containing a signal sequence. If the secreted protein is released 

30 into the extracellular space, the secreted protein can undergo extracellular processing 
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to produce a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

In specific embodiments, the polynucleotides of the invention are less than 
300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
5 embodiment, polynucleotides of the invention comprise at least 15 contiguous 
nucleotides of the coding sequence, but do not comprise all or a portion of any intron. 
In another embodiment, the nucleic acid comprising the coding sequence does not 
contain coding sequences of a genomic flanking gene (i.e., 5' or 3* to the gene in the 
genome). 

10 As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 

sequence contained in SEQ ID NO:X or the cDNA contained within the clone 
deposited with the ATCC. For example, the polynucleotide can contain the 
nucleotide sequence of the full length cDNA sequence, including the 5' and 3 f 
untranslated sequences, the coding region, with or without the signal sequence, the 

15 secreted protein coding region, as well as fragments, epitopes, domains, and variants 
of the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having the translated amino acid sequence generated from the 
polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO:X 

20 was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 

25 Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
microorganisms for purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those 
polynucleotides capable of hybridizing, under stringent hybridization conditions, to 

30 sequences contained in SEQ ID NO.X, the complement thereof, or the cDNA within 
the clone deposited with the ATCC. "Stringent hybridization conditions" refers to an 
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overnight incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 
mM NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 fig/ml denatured, sheared salmon sperm DNA, 
followed by washing the filters in 0. lx SSC at about 65°C. 
5 Also contemplated are nucleic acid molecules that hybridize to the 

polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or 
10 temperature. For example, lower stringency conditions include an overnight 

incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M 
NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon 
sperm blocking DNA; followed by washes at 50°C with 1XSSPE, 0.1% SDS. In 
addition, to achieve even lower stringency, washes performed following stringent 
15 hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
20 commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3* terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
25 complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically 
any double-stranded cDNA clone). 

The polynucleotide of the present invention can be composed of any 
30 polyribonucleotide or polydeoxribonucleotide, which may be unmodified RN A or 

DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
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single- and double-stranded DNA, DN A that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded or a mixture of single- 
5 and double-stranded regions. In addition, the polynucleotide can be composed of 
triple-stranded regions comprising RNA or DNA or both RNA and DNA. A 
polynucleotide may also contain one or more modified bases or DNA or RNA 
backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of 
10 modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolically modified forms. 

The polypeptide of the present invention can be composed of amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres, and may contain amino acids other than the 20 gene-encoded amino acids. 
15 The polypeptides may be modified by either natural processes, such as 

posttranslational processing, or by chemical modification techniques which are well 
known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid 
20 side-chains and the amino or carboxyl termini. It will be appreciated that the same 
type of modification may be present in the same or varying degrees at several sites in 
a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched , for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, 
25 and branched cyclic polypeptides may result from posttranslation natural processes or 
may be made by synthetic methods. Modifications include acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
30 cross-linking, cyclization, disulfide bond formation, demethylation, formation of 

covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
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gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 
iodination, methylation, myristoylation, oxidation, pegylation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
5 (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C 
Johnson, Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth 
Enzymol 182:626-646 (1990); Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 
10 M SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO:Y" 

refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 

15 present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency does 
exist, it need not be identical to that of the polypeptide, but rather substantially similar 
to the dose-dependence in a given activity as compared to the polypeptide of the 
present invention (i.e., the candidate polypeptide will exhibit greater activity or not 

20 more than about 25-fold less and, preferably, not more than about tenfold less 

activity, and most preferably, not more than about three-fold less activity relative to 
the polypeptide of the present invention.) 



25 



Polynucleotides and Polypeptides of the Invention 



FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in anergic T cells and merkel cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
30 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders and inflammatory diseases. Similarly, polypeptides 
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and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
5 tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 
10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 

108 as residues: Ala-55 to Gln-64. 

The tissue distribution in T-cells and merkel cells indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of immune system 
diseases. Furthermore, 

15 Expression of this gene product in T-cells indicates a role in the regulation of 

the proliferation; survival; differentiation; and/or activation of potentially all 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 

20 boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 

25 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the.protein may show utility as a 

30 tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2329 of SEQ ID NO: 1 1, b is an 
integer of 15 to 2343, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO: 1 1 , and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: IPENRRPASXCTWSMWTSRTTTRRPPWGRFSS VSSASV 

15 SSTRKTWRTRSTSCCRSSRRRVAAPFCTPSASTEPSARMEPPLELPVVHTFSFL 
TFVFTYRCSAGDGSITQINCAYEMGEEMPKRQMKAIKFLLFHFYL (SEQ ID 
NO:205), IPENRRPASXCTWSMWTSRTTTRRPPWGRFSS VSSASVSST (SEQ ID 
NO:206), RKTWRTRSTSCCRSSRRRVAAPFCTPSASTEPSARMEPPLELP (SEQ 
ID NO:207), and/or VVHTFSFLTFVFTYRCSAGDGSITQINCAYEMGEEMPKRQ 

20 MKAIKFLLFHFYL (SEQ ID NO:208). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

This gene is expressed primarily in placental, brain and breast tissues, and to a 
lesser extent in T cells and tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurodegenerative and/or endocrine disorders and neoplasias, or 
developmental disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 

30 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the neurodegenerative, developing, endocrine and 
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immune systems, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues or cell types (e.g., brain, endocrine, immune, 
developing, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
5 from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
109 as residues: Ala-55 to Asn-60, Lys-65 to Met-71, Leu-75 to Asn-86, Asp-93 to 
10 Asp-1 10, Leu-130 to Cys-138, Gln-149 to Glu-154, Thr-172 to Ile-179, Glu-185 to 
Arg-192. 

The tissue distribution in breast and brain tissues indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of endocrine 
disorders, neurodegenerative disorders, developmental disorders, immune system 

15 diseases and neoplasias. The tissue distribution in placental tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders of the placenta. Specific expression within the 
placenta indicates that this gene product may play a role in the proper establishment 
and maintenance of placental function. Alternately, this gene product may be 

20 produced by the placenta and then transported to the embryo, where it may play a 
crucial role in the development and/or survival of the developing embryo or fetus. 

Expression of this gene product in a vascular-rich tissue such as the placenta 
also indicates that this gene product may be produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 

25 vascular function, such as in angiogenesis. It may also be produced in the vasculature 
and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Likewise, 

Expression of this gene product in T-cells indicates a role in the regulation of 

30 the proliferation; survival; differentiation; and/or activation of potentially all 

hematopoietic cell lineages, including blood stem cells. This gene product may be 
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involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
5 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

10 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in brain tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection/treatment of neurodegenerative disease states and behavioural disorders 

15 such as Alzheimers Disease, Parkinsons Disease, Huntingtons Disease, Tourette 

Syndrome, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder, 
panic disorder, learning disabilities, ALS, psychoses, autism, and altered behaviors, 
including disorders in feeding, sleep patterns, balance, and perception. In addition, the 
gene or gene product may also play a role in the treatment and/or detection of 

20 developmental disorders associated with the developing embryo, or sexually-linked 
disorders. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO: 12 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 

30 formula of a-b, where a is any integer between 1 to 1 163 of SEQ ID NO: 12, b is an 
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integer of 15 to 1 177, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 12, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

The translation product of this gene shares sequence homology with bovine 
beta-mannosidase, which is thought to be important in lysosomal catabolism of 
glycoproteins. See, for example, J. Biol. Chem. 270, 3841-3848 (1995), incorporated 
herein by reference in its entirety. Based on the sequence similarity between these 
proteins the translation product of this gene will sometimes hereinafter be reffered to 
as human beta-mannosidase. Human beta-mannosidase is expected to share certain 
biological activities, particularly enzymatic activities, with bovine beta-mannosidase. 
Such activities may be assayed by methods known in the art, described in J. Biol. 
Chem, 270, 3841-3848 (1995), and/or disclosed elsewhere herein. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: HPSIIIWSGNNENEEALMMNWYHISFTDRPIYIKDYVTL 
YVKNIRELVLAGDKSRPFITSSPTNGAETVAEAWVSQNPNSNYFGDVHFYDYI 
SDCWNWKVFPKARFASEYGYQSWPSFSTLEKVSSTEDWSFNSKFSLHRQHH 
EGGNKQMLYQAGLHFKLPQSTDPLRTFKDTIYLTQVMQAQCVKTETEFYRRS 
RSEIVDQQGHTMGALYWQLNDIWQAPSW (SEQ ID NO:209), and/or 
VRVHTWS 

SLEPVCSRVTERFVMKGGEAVCLYEEPVSELLRRCGNCTRESCVVSFYLSAD 
HELLSPTNYHFLSSPKEAVGLCKAQITAIISQQGDIFVFDLETSAVAPFVWLDV 
GSIPGRFSDNGFLMTEKTRTILFYPWEPTSKNELEQSFHVTSLTDIY (SEQ ID 
NO:210). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The gene encoding the disclosed cDNA is thought to reside on 
chromosome 4. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 4. 

This gene is expressed primarily in colon tissue, and to a lesser extent in 
thymus stromal cells and chondrosarcoma tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, chondroma and mannosidosis. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
5 the above tissues or cells, particularly of the chondro and immune system. The 
expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., immune, metabolic, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 

10 a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to bovine beta-mannosidase indicates 
that the protein products of this gene are useful for the diagnosis and/or treatment of 
chondroma and mannosidosis. Human beta-rnannosidosis is an autosomal recessive, 

15 lysosomal storage disease caused by a deficiency of the enzyme beta-mannosidase. 
Furthermore, the homology of the translation product of this gene to beta- 
mannosidase indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis, prevention, and/or treatment of various metabolic 
disorders such as lysosomal storage deficiencies, Tay-Sachs disease, 

20 phenylkenonuria, galactosemia, hyperlipidemias, porphyrias, and Hurler's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO: 13 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 

30 formula of a-b, where a is any integer between 1 to 2093 of SEQ ID NO: 13, b is an 
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integer of 15 to 2107, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: PRLTPRMKWPTAALASRLLGWTVLRPPYPRVPSLPQVT 
LHPTDGLMAVLYTGGEGRTLGEQHFFHETFVTRWLLGPVPVRFGACSPLSFL 
APRRGQGAPAGXFCACPRPASRQLCPWPALPGTPYSNSAPLCTGMGHSNTPQ 
GPPSPQ YALSPTEPTSLSGNSHLPAIL VL (SEQ ID NO:211), 

10 PRLTPRMK WPT A AL ASRLLGWTVLRPPYPRVPSLPQVTLHP (SEQ ID 
NO:212), TDGLMAVLYTGGE GRTLGEQHFFHETFVTRWLLGPVPVRFG (SEQ 
ID NO:213), ACSPLSFLAPRRGQGAPAGXFCACPRPAS RQLCPWPALPGTP 
(SEQ ID N O : 2 1 4 ) , and/or 

YSNSAPLCTGMGHSNTPQGPPSPQYALSPTEPTSLSGNS HLPAILVL (SEQ ID 

15 NO:215). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in human lung (adult and fetal), and to a 
lesser extent in liver and brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, pulmonary disorders and hemostasis. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 

25 disorders of the above tissues or cells, particularly of the lung and liver tissues, 
expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., pulmonary, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a 

30 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 1 1 as residues: Arg-28 to Gln-36. 

The tissue distribution in lung and liver tissues indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of pulmonary 
5 disorders and hematopoietic disorders. The tissue distribution in adult and fetal lung 
tissues indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the detection and treatment of disorders associated with developing lungs, 
particularly in premature infants where the lungs are the last tissues to develop. The 
tissue distribution indicates that polynucleotides and polypeptides corresponding to 

10 this gene are useful for the diagnosis and intervention of lung tumors, since the gene 
may be involved in the regulation of cell division, particularly since it is expressed in 
fetal tissue. Alternatively, 

Expression of this gene product in liver tissue indicates a role in the regulation 
of the proliferation; survival; differentiation; and/or activation of potentially all 

15 hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

20 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

25 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 14 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1248 of SEQ ID NO: 14, b is an 
integer of 15 to 1262, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 14, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: HLLE VTPCRLP VPEFPGRTPRGS RTPD (SEQ ID NO:216). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in rapidly dividing liver tissue, (e.g., 
hepatoma, hepatocellular carcinoma, and fetal liver tissue), and to a lesser extent in 
normal liver tissue, and other tumors such as colon cancer and uterine cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, particularly hepatomas, colon cancer, and uterine cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
liver, colon and uterus, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues or cell types (e.g., liver, colon, uterus, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
112 as residues: Trp-35 to Trp-45, Pro-52 to Asp-57, Thr-73 to Arg-82, Pro-105 to 
Leu-1 12, Pro-1 15 to Arg-127, Pro-140 to Gln-151. 

The tissue distribution in liver tissues and cancers thereof, as well as other 
cancerous tissues, indicates that the protein products of this gene are useful for the 
diagnosis and/or treatment of cancers, particularly, hepatoma, colon cancer and 
uterine cancer, as well as cancers of other tissues where expression has been 
observed. Furthermore, expression within cellular sources marked by proliferating 
cells indicates that this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 15 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 745 of SEQ ID NO: 15, b is an 
integer of 15 to 759, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 15, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This gene is expressed primarily in hepatocellular tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hepatomas. Similarly, polypeptides and antibodies directed to these 
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polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues or cell types (e.g., liver, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 13 as residues: Pro-32 to Gly-40. 

The tissue distribution in hepatocellular tumors indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of hepatomas, as 
well as cancers of other tissues where expression has been observed. Furthermore, 
expression within cellular sources marked by proliferating cells indicates that this 
protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Thus, this 
protein may also be involved in apoptosis or tissue differentiation and could again be 
useful in cancer therapy. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 16 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1796 of SEQ ID NO: 16, b is an 
integer of 15 to 1810, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 16, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

This gene is expressed primarily in human rhabdomyosarcoma tissue, as well 
as in placental tissue. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, malignant neoplasms and reproductive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

10 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skeletal system 
and reproductive system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues or cell types (e.g., reproductive, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 

15 synovial fluid and spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
20 1 14 as residues: Arg-23 to Trp-28, Phe-93 to Lys-98, Arg-199 to Trp-206, Gly-208 to 
Met-213. 

The tissue distribution in placental tissue and human rhabdomyosarcoma 
tissue indicates that the protein products of this gene are useful for the diagnosis 
and/or treatment of skeletal and reproductive disorders. Furthermore, the tissue 

25 distribution in placental tissue indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and/or treatment of disorders 
of the placenta. Specific expression within the placenta indicates that this gene 
product may play a role in the proper establishment and maintenance of placental 
function. Alternately, this gene product may be produced by the placenta and then 

30 transported to the embryo, where it may play a crucial role in the development and/or 
survival of the developing embryo or fetus. 
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Expression of this gene product in a vascular-rich tissue such as the placenta 
also indicates that this gene product may be produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 
vascular function, such as in angiogenesis. It may also be produced in the vasculature 
5 and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 17 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 

15 cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1038 of SEQ ID NO: 17, b is an 
integer of 15 to 1052, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

This gene is expressed primarily in fetal liver/spleen and fetal skin tissues, and 
to a lesser extent in breast cancer tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental disorders and neoplasias. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 

30 disorders of the above tissues or cells, particularly of the fetal tissue and adult 

immune system, expression of this gene at significantly higher or lower levels may be 
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routinely detected in certain tissues or cell types (e.g., developing, immune, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
5 level in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in fetal liver/spleen and skin tissues indicates that the 
protein products of this gene are useful for the diagnosis and/or treatment of 
developmental disorders and malignant neoplasias. Likewise, expression within fetal 
tissue and other cellular sources marked by proliferating cells indicates that this 

10 protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Similarly, fetal 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. 

15 Alternatively, the tissue distribution in fetal skin tissue indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 
disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 

20 carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e.wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 
photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 
dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 

25 erythema, petechiae, purpura, and xanthelasma. Moreover, such disorders may 

predispose increased susceptibility to viral and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 
erysipelas, impetigo, tinea, althletes foot, and ringworm). Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

30 immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 18 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 1 1 6 of SEQ ID NO: 1 8, b is an 
integer of 15 to 1 130, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 18, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

The translation product of this gene shares sequence homology with the 
bacterial gufA gene, as well as a C. elegans protein of unknown function. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: MIPGSDSQTALNFGSTLMKKKSDPEGPALLFPESELSIRI 
GRAGLLSDKSENGEAYQRKKAAATGLPEGPAVPVPSRGNLAQPGGSSWRRI 
ALLILAITIHNVPEGLAVGVGFGAIEKTASATFESARNLAIGIGIQNFPEGLAVS 
LPLRGAGFSTWRAFWYGQLSGMVEPLAGVFGAFAVVLAEPILPYALAFAAG 
AMVYVVMDDIIPEAQISGNGKLASWASILGFVVMMSLDVGLG (SEQ ID 
NO:217), MIPGSDSQTALNFGSTLMKKKSDPEGPALLFPESELSIRIGRA (SEQ 
ID NO:218), GLLSDKSENGEAYQRKKAAATGLPEGPAVPVPSRGNLAQPG 
(SEQ ID N O : 2 1 9 ) , 

GSSWRRIALLILAITIHNVPEGLAVGVGFGAIEKTASATFESAR (SEQ ID 
NO:220), NLAIGIGIQNFPEGLAVSLPLRGAGFSTWRAFWYGQLS GMVEP 
(SEQ ID NO:221), LAGVFGAFAVVLAEPILPYALAFAAGAMVYVVM 
DDIIPEAQIS (SEQ ID NO:222), and/or GNGKLASWASILGFVVMMSLDVGLG 
(SEQ ID NO:223). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in cells of the immune system, particularly 
macrophage. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the immune system, such as AIDS, as well as 
5 inflammatory disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues or cell 

10 types (e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Expression of this gene product in immune cells such as 
macrophage indicates a role in the regulation of the proliferation; survival; 
differentiation; and/or activation of potentially all hematopoietic cell lineages, 

20 including blood stem cells. This gene product may be involved in the regulation of 
cytokine production, antigen presentation, or other processes that may also suggest a 
usefulness in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

25 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 

30 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in macrophage also strongly indicates a role 
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for this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 19 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 869 of SEQ ID NO: 19, b is an 
integer of 15 to 883, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 19, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

This gene is expressed primarily in the spleen metastic melanoma tissue as 
well as in embryonic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders affecting the spleen or immune system, developmental 
disorders, and cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues or cell 
types (e.g., spleen, developing, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 17 as residues: Asn-37 to Lys-44, Ser-73 to Glu-78, Ala-103 to Ser-1 1 1. 

The tissue distribution in spleen metastic melanoma and embryonic tissues 
indicates that the protein products of this gene are useful for the diagnosis and/or 
treatment of disorders affecting the spleen, including cancers of the spleen, as well as 
cancers of other tissues where expression has been observed. Furthermore, expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
indicates that this protein may play a role in the regulation of cellular division, and 
may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Similarly, embryonic development also involves decisions involving cell 
differentiation and/or apoptosis in pattern formation. Thus, this protein may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:20 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 975 of SEQ ID NO.20, b is an 
integer of 15 to 989, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:20, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes and neutrophils. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders affecting the immune 
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system, such as AIDS. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
1 18 as residues: Ser-12 to Asp-20, Gly-22 to Gly-32, Ala-49 to Thr-57. 

The tissue distribution in monocytes and neutrophils indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of immune system 
disorders, including AIDS. Furthermore, expression of this gene product in 
monocytes and neutrophils suggests a role in the regulation of the proliferation; 
survival; differentiation; and/or activation of potentially all hematopoietic cell 
lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes and neutrophils also strongly 
suggests a role for this protein in immune function and immune surveillance. Protein, 
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as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO:21 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 481 of SEQ ID NO:21 , b 
is an integer of 15 to 495, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:21, and where b is greater than or equal to a 
+ 14. 

1 5 FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: disorders affecting the immune 
system. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 

25 may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

30 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

1 19 as residues: Glu-35 to Trp-42. 
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The tissue distribution suggests that the protein product of this clone is useful 
for the diagnosis and treatment of a variety of immune system disorders. Expression 
of this gene product in monocytes suggests a role in the regulation of the 
proliferation; survival; differentiation; and/or activation of potentially all 
5 hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

10 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

15 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

20 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:22 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2303 of SEQ ID NO:22, b 
is an integer of 15 to 2317, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO:22, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes. 
5 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the immune system. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 

10 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., immune, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 

15 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in monocytes indicates that the protein products of this 
clone are useful for the diagnosis and/or treatment of disorders of the immune system. 
Expression of this gene product in monocytes suggests a role in the regulation of the 

20 proliferation; survival; differentiation; and/or activation of potentially all 

hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

25 Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 

30 bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
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various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:23 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1712 of SEQ ID NO:23, b 
is an integer of 15 to 1726, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:23, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

The translation product of this gene shares sequence homology with a gene 
from C. elegans of unknown function. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: TRPITYVLLAG (SEQ ID NO:224). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. The gene encoding the 
disclosed cDNA is thought to reside on chromosome 1 L Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 1 1. 

It has been discovered that this gene is expressed primarily in fetal lung, liver, 
spleen and heart tissues, as well as adult liver, bladder, endometrial stromal cells, 
synovium, colon cancer, smooth muscle, keratinocytes, and the bone marrow derived 
cell line RS4;11. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the musculoskeletal 
system, and cancers of the immune system. Similarly, polypeptides and antibodies 

5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the musculo-skeletal and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, musculo-skeletal, cancerous and wounded 

10 tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in tissues of the immune system indicates that the 

15 protein products of this clone are useful for treating proliferative disorders of immune 
system precursor cells. Alternatively, the tissue distribution in smooth muscle and 
heart tissue indicates that the protein product of this gene is useful for the diagnosis 
and treatment of conditions and pathologies of the cardiovascular system, such as 
heart disease, restenosis, atherosclerosis, stoke, angina, thrombosis, and wound 

20 healing. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:24 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 515 of SEQ ID NO:24, b 

30 is an integer of 15 to 529, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO: 24, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GTS LT APLLEFLL AL YFLF AD AMQLNDK WQGLC WP 
(SEQ ID NO:225). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in T-cells, fetal 
spleen and infant brain tissues, and to a lesser extent in many other tissues including 
melanocytes, lung cancer, macrophages, dendritic cells, stromal cells, adrenal gland 
and others. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: inflammation and autoimmunity, 
developing tissues. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and developing system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, developing, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
122 as residues: Ser-46 to Gly-51. 

The tissue distribution in T-cells and other immune cells indicates that the 
protein products of this clone are useful for treating diseases involving the activation 
of T-cells, including inflammation and autoimmune diseases. Alternatively, the tissue 
distribution in a wide range of fetal tissues suggests that this protein may play a role 
in the regulation of cellular division, and may show utility in the diagnosis and 
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treatment of cancer and other proliferative disorders. Similarly, fetal development 
also involves decisions involving cell differentiation and/or apoptosis in pattern 
formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 

5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:25 and may have been publicly available prior to conception of 

10 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1741 of SEQ ID NO:25, b 

15 is an integer of 15 to 1755, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:25, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: LANFZCSDCAQTVLFVLZFZILVFTYEIPF (SEQ ID 
NO:226). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The gene encoding the disclosed cDNA is thought to reside on 
chromosome 13. Accordingly, polynucleotides related to this invention are useful as a 

25 marker in linkage analysis for chromosome 13. Recently another group published this 
gene, referring to it as CLN5 (See Genbank Accession No.: 3342386). 

It has been discovered that this gene is expressed primarily in placental tissue, 
12 week embryos, and tumors including testes, tongue and pharynx, and to a lesser 
extent in adipose tissue, tonsils, melanocytes, fetal spleen, macrophages, T-cells, 

30 amniotic cells, and brain tissue. 
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• Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tumors, particularly of the tongue 
and throat, and neurodegenerative disorders. Similarly, polypeptides and antibodies 
5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the neural and digestive systems, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., tongue, throat, brain, cancerous and wounded tissues) or bodily 

10 fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
123 as residues: Pro-44 to Ala-60, Val-187 to Thr-193, Lys-203 to Ala-210, Thr-212 

15 toCys-219. 

The tissue distribution in tongue and pharynx carcinoma tissue indicates that 
the protein products of this clone are useful for diagnosing and/or treating oral 
cancers, including tumors of the throat and tongue. Furthermore, the tissue 
distribution in brain tissue suggests that the protein product of this clone is useful for 

20 the detection/treatment of neurodegenerative disease states and behavioural disorders 
such as neuronal ceroid lipofuscinoses (NCLs), Alzheimers Disease, Parkinsons 
Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 

25 patterns, balance, and perception. In addition, the gene or gene product may also play 
a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, or sexually-linked disorders. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:26 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1737 of SEQ ID NO:26, b 
is an integer of 15 to 175 1, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:26, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

Q A WHE VGGG VRRC WF VLGERRAGS LLS AS YGTF AMPG 

1 5 MVLFGRRWAI ASDDLVFPGFFELVVRVLWWIGILTL YL (SEQ ID NO:227), 

and/or PGMVLFGRRWAIASDDLVFPGFFELVVRVLWWIGILTLYLMHRGKLD 
CAGGALLSSYLIVLMILLAVVICTVSAIMCVSMRGTICNPGPRKSMSKLLYIRL 
ALFFPEM VW AS LG A A WV ADG VQCD (SEQ ID NO:228). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

20 It has been discovered that this gene is expressed in activated neutrophils, 

infant brain tissue and primary dendritic cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the immune system, 

25 and neurodegenerative disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune and neural systems, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 

30 types (e.g., immune, brain, cancerous and wounded tissues) or bodily fluids (e.g., 

lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
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having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
124 as residues: Pro-47 to Met-53, Serrl30 to Ser-138. 
5 The tissue distribution in neutrophils and primary dendritic cells indicates that 

the protein products of this clone are useful for diagnosing and/or treating immune 
system disorders. Expression of this gene product in neutrophils and primary dendritic 
cells suggests a role in the regulation of the proliferation; survival; differentiation; 
and/or activation of potentially all hematopoietic cell lineages, including blood stem 

10 cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

15 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 

20 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in neutrophils and primary dendritic cells also 
strongly suggests a role for this protein in immune function and immune surveillance. 

Alternatively, the tissue distribution in brain tissue suggests that the protein 
product of this clone is useful for the detection/treatment of neurodegenerative 

25 disease states and behavioural disorders such as Alzheimers Disease, Parkinsons 

Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, the gene or gene product may also play 

30 a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, or sexually-linked disorders. Protein, as well as, antibodies 
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directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO:27 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 1 198 of SEQ ID NO:27, b 
is an integer of 15 to 1212, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:27, and where b is greater than or equal to a 
+ 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

It has been discovered that this gene is expressed primarily in neutrophils, and 
to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: immune and inflammatory 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 

25 may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

30 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

125 as residues: Gin- 17 to Ser-24. 
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The tissue distribution in neutrophils indicates that the protein products of this 
clone are useful for the diagnosis and/or treatment of immune and inflammatory 
disorders. Expression of this gene product in neutrophils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
5 potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

10 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

15 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in neutrophils also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

20 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:28 and may have been publicly available prior to conception- of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:28, b 
is an integer of 15 to 1112, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO: 28, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: HERNCFPMWLNHSAFPPV (SEQ ID NO:229). 
5 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in neutrophils, and 
to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
10 diagnosis of the following diseases and conditions: immune and inflammatory 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 
15 may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

20 The tissue distribution in neutrophils indicates that the protein products of this 

clone are useful for the diagnosis and/or treatment of immune and inflammatory 
disorders. Expression of this gene product in neutrophils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 

25 product may be involved in the regulation of cytokine production, antigen 

presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

30 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
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deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
5 types. Expression of this gene product in neutrophils also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:29 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 734 of SEQ ID NO:29, b 
is an integer of 15 to 748, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:29, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GWTRENDHRALSKAGIGSAEIQPSNLRVGSAKDLGKPW 
AGKLLLLSSCLLFFSLGVLYRGQMLAPPLQEDWKGGVKDSDLIDDSSASPIPP 
25 S YLEYKAALYPFSEHKSVRNATDSLTFFLVTDHFLDNQDSQ (SEQ ID 

NO:230), GWTRENDHRALSKAGIGSAEIQPSNLRVGSAKDLGKPWAGKLLLL 
(SEQ ID NO:231), 

SSCLLFFSLGVLYRGQMLAPPLQEDWKGGVKDSDLIDDSS ASPIPP (SEQ ID 
NO:232), and/or S YLEYKAALYPFSEHKSVRNATDSLTFFLVTDHFL DNQDSQ 
30 (SEQ ID NO:233). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in ovarian cancer 
tissue, and to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: ovarian cancer Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the ovaries, 
expression of this gene at significantly higher or lower levels may be detected in 
10 certain tissues or cell types (e.g., reproductive, cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

127 as residues: Thr-20 to Gly-27, Gly-32 to Phe-41. 

The tissue distribution in ovarian cancer tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of ovarian cancer, 
as well as cancers of other tissues where expression has been observed. Protein, as 

20 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:30 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 764 of SEQ ID NO:30, b 

30 is an integer of 15 to 778, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:30, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

When tested against U937 Myeloid cell lines, supernatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells, and to a lesser extent other cells, through the Jak-STAT signal 
transduction pathway. The gamma activating sequence (GAS) is a promoter element 
found upstream of many genes which are involved in the Jak-STAT pathway. The 
Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LKFHQESLSGD (SEQ ID NO:234). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fast-growing 
tissues such as immune/hematopoietic tissues, early developmental stage human 
tissues, and tumor tissues, and to a lesser extent in some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: growth disorders, immune and 
inflammatory diseases, and tumorigenesis. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune/hematopoietic system, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
128 as residues: Glu-60 to Arg-65. 

The tissue distribution in immune tissues, in conjunction with the biological 
activity data, indicates that the protein products of this clone are useful for the 
5 diagnosis and/or treatment of growth disorders, immune and inflammatory diseases, 
and tumorigenesis. Furthermore, expression within embryonic tissue and other 
cellular sources marked by proliferating cells suggests that this protein may play a 
role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, embryonic 

10 development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

15 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:31 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

20 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 13 10 of SEQ ID NO:3 1 , b 
As an integer of 15 to 1324, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:31, and where b is greater than or equal to a 

25 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: EAKSRPVTQAGVQWHDLGSLQPLPP (SEQ ID NO:235). 
30 Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in ovarian cancer 
tissue, and to a lesser extent in fetal liver/spleen and retinal tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: ovarian cancer, immune disorders, 
and retinal disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the ovaries, immune and ocular systems, expression of this gene at 

10 significantly higher or lower levels may be detected in certain tissues or cell types 

(e.g., reproductive, ovaries, retina, immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 The tissue distribution in ovarian cancer tissue indicates that the protein 

products of this clone are useful for the diagnosis and/or treatment of ovarian cancer, 
as well as cancers of other tissues where expression has been observed. The tissue 
distribution also suggests that the protein product of this clone is useful for the 
diagnosis and treatment of a variety of immune system disorders. Expression of this 

20 gene product in fetal liver/spleen suggests a role in the regulation of the proliferation; 
survival; differentiation; and/or activation of potentially all hematopoietic cell 
lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 

25 responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
30 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
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commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in retinal tissue suggests that the protein 
product of this clone is useful for the treatment and/or detection of eye disorders 
5 including blindness, color blindness, impaired vision, short and long sightedness, 
retinitis pigmentosa, retinitis proliferans, and retinoblastoma, retinochoroiditis, 
retinopathy and retinoschisis. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:32 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 725 of SEQ ID NO:32, b 
is an integer of 15 to 739, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:32, and where b is greater than or equal to a 

20 + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

The translation product of this gene shares sequence homology with a C. 
elegans protein of unknown function (See Genbank Accession No.: 

25 gnllPIDIe 13480 17). When tested against fibroblast cell lines, supernatants removed 
from cells containing this gene activated the EGR1 assay. Thus, it is likely that this 
gene activates fibroblast cells through a signal transduction pathway. Early growth 
response 1 (EGR1) is a promoter associated with certain genes that induces various 
tissues and cell types upon activation, leading the cells to undergo differentiation and 

30 proliferation. The gene encoding the disclosed cDNA is thought to reside on 
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chromosome 17. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 17. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: EAKSRPVTQAGVQWHDLGSLQPLPP (SEQ ID NO:236), 
and/or ALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSELELDDVVIT 
NPHIEAILENEDWIEDASGLMSHCIAILM 

SVSDIIVVAKRISPRVDDVVKSMYPPLDPKLLDAR (SEQ ID NO:237). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fast growing 
tissues such as early development stage human tissues, immune/hematopoietic 
tissues, melanocytes, and tumor tissues, and to a lesser extent in some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: growth disorders, immune and 
inflammatory disoders, skin and connective tissue disorders, and tumorigenesis. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the fast 
growing tissues such as early development stage human tissues, 
immune/hematopoietic tissues, skin and connective tissue, and tumor tissues, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., musculoskeletal, skin, immune, developing, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
130 as residues: Pro-34 to Ser-43, Glu-54 to Ser-60. 

The tissue distribution suggests that the protein product of this clone is useful 
for the diagnosis and/or treatment of growth disorders, immune and inflammatory 
disorders, and tumorigenesis. Alternatively, the tissue distribution in melanocytes, in 
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conjunction with the observed biological activity data, suggests that the protein 
product of this clone is useful for the treatment, diagnosis, and/or prevention of 
various skin disorders including congenital disorders (i.e. nevi, moles, freckles, 
Mongolian spots, hemangiomas, port-wine syndrome), integumentary tumors (i.e. 
keratoses, Bowen's disease, basal cell carcinoma, squamous cell carcinoma, 
malignant melanoma, Paget' s disease, mycosis fungoides, and Kaposi* s sarcoma), 
injuries and inflammation of the skin (i.e.wounds, rashes, prickly heat disorder, 
psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, autoimmune 
disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, scleroderma, 
pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, purpura, and 
xanthelasma. 

Moreover, such disorders may predispose increased susceptibility to viral and 
bacterial infections of the skin (i.e. cold sores, warts, chickenpox, molluscum 
contagiosum, herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, 
and ringworm). Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and immunotherapy targets for the above listed tumors and 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:33 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
.would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1448 of SEQ ID NO:33, b 
is an integer of 15 to 1462, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:33, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

When tested against U937 Myeloid cell lines, supernatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells, and to a lesser extent other cells, through the Jak-STAT signal 
5 transduction pathway. The gamma activating sequence (GAS) is a promoter element 
found upstream of many genes which are involved in the Jak-STAT pathway. The 
Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 

10 proteins involved in the proliferation and differentiation of cells 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

DVESRGPSARCLPVVPGSLLPGLEPATKLMPGGLAPGHG 
APVRELLLPLLSQPTLGSLWDSLRHCSLLCNPLSCVPALEAPPSLVSLGCSGGC 

1 5 PRLSLAGS ASPFPFLTALLSLLNTLAQIHKGLCGQLA AILAAPGLQNYFLQC VA 
PGAAPHLTPFSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMAL 
ALLSRLLPGSEYLTHELLLSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPR 
CGQGTLLAQACQDLPSIRNCYLTHCSPARASLLASQALHRGELQRVPTLLLP 
MPTEPLLPTDWPFLH (SEQ ID N 0:238), 

20 D VES RGPS ARCLP V VPGS LLPGLEP ATKLM PGGLAPGHGAPVRE (SEQ ID 
NO:239), LLLPLLSQPTLGSLWDSLRHCSLLCNP LSCVPALEAPPSLVSLGC 
(SEQ ID NO:240), S GGC PRLS L AGS AS PFPFLT ALL 
SLLNTLAQIHKGLCGQLAABLA (SEQ ID NO:241), APGLQNYFLQCVAPGAAP 
HLTP FS AW ALRHE YHLQ YL AL AL AQK (SEQ ID NO:242), AAALQPLPATHAA 

25 LYHGMALALLSRLLPGSEYLTHELLLSCVFR (SEQ ID NO:243), LEFLPERTSG 
GPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDL (SEQ ID NO:244), and/or 
PSIRNCYLTHCSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLH 
(SEQ ID NO:245). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in hematopoietic 
tissues and fetal heart tissue, and to a lesser extent in brain and gall bladder tissues, 
and some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
5 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and inflammatory 
disorders, cardiovascular disorders, and growth disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the hematopoietic and vascular systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., vascular, immune, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 

15 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
131 as residues: Tyr-88 to Trp-102, Asp-105 to Ser-110. 

The tissue distribution in hematopoietic tissues, in conjunction with the 

20 observed biological activity data, indicates that the protein products of this clone are 
useful for the diagnosis and/or treatment of immune and inflammatory disorders and 
growth disorders. Alternatively, the tissue distribution in fetal heart tissue indicates 
that the protein product of this gene is useful for the diagnosis and treatment of 
conditions and pathologies of the cardiovascular system, such as heart disease, 

25 restenosis, atherosclerosis, stoke, angina, thrombosis, and wound healing. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

30 related to SEQ ID NO:34 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
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excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2801 of SEQ ID NO: 34, b 
5 is an integer of 15 to 2815, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:34, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 
10 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: VGSVLGAFLTFPGLRLAQTHRDALT (SEQ ID NO:246). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 

The gene encoding the disclosed cDNA is thought to reside on chromosome 19. 

Accordingly, polynucleotides related to this invention are useful as a marker in 
15 linkage analysis for chromosome 19. 

It has been discovered that this gene is expressed primarily in human pituitary 

tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: hyperpituitarism and 

hypopituitarism. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine system, expression of this gene at significantly higher or 

25 lower levels may be detected in certain tissues or cell types (e.g., endocrine, 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. This gene is found on the short arm 

30 of chromosome 19 and, therefore, is useful as a chromosome marker. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
132 as residues: MeM to Pro-6, Gln-89 to Ala-94, Pro-161 to Cys-173. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of pituitary disorders. More 
5 generally, the tissue distribution in pituitary tissue suggests that the protein product of 
this clone is useful for the detection, treatment, and/or prevention of various 
endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
10 hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothallamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
15 related to SEQ ID NO:35 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
20 general formula of a-b, where a is any integer between 1 to 1064 of SEQ ID NO:35, b 
is an integer of 15 to 1078, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:35, and where b is greater than or equal to a 
- + 14. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

It has been discovered that this gene is expressed highly and specifically in 
placental and bone marrow cDNA libraries, and to a lesser extent in T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
30 diagnosis of the following diseases and conditions: immune, developmental and 
reproductive disorders. Similarly, polypeptides and antibodies directed to those 
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polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and developing systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
5 (e.g., immune, developmental, reproductive, cancerous and wounded tissues) or 

bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

10 The tissue distribution in bone marrow and placental tissue indicates that the 

protein products of this clone are useful for the diagnosis and/or treatment of immune 
and reproductive disorders. The tissue distribution in bone marrow suggests that the 
protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 

15 thrombocytopenia or leukemia. The uses include bone marrow cell ex vivo culture, 
bone marrow transplantation, bone marrow reconstitution, radiotherapy or 
chemotherapy of neoplasia. The gene product may also be involved in lymphopoiesis, 
therefore, it can be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 

20 the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. 

Alternatively, the tissue distribution in placental tissue suggests that the 
protein product of this clone is useful for the diagnosis and/or treatment of disorders 
of the placenta. Specific expression within the placenta suggests that this gene 

25 product may play a role in the proper establishment and maintenance of placental 
function. Alternately, this gene product may be produced by the placenta and then 
transported to the embryo, where it may play a crucial role in the development and/or 
survival of the developing embryo or fetus. 

Expression of this gene product in a vascular-rich tissue such as the placenta 

30 also suggests that this gene product may be produced more generally in endothelial 

cells or within the circulation. In such instances, it may play more generalized roles in 
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vascular function, such as in angiogenesis. It may also be produced in the vasculature 
and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Protein, as well as, 

5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:36 and may have been publicly available prior to conception of 

10 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1203 of SEQ ID NO:36, b 

15 is an integer of 15 to 1217, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:36, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: 

LECTDTIMVHCSLKLLSPSDXSHSASQVAKTRGVHHXTQ 
. LBFKVFFVXMGSHSTKYXSIRPGLLP (SEQ ID NO:247). Polynucleotides 

encoding these polypeptides are also encompassed by the invention. 
25 It has been discovered that this gene is expressed primarily in human prostate 

and smooth muscle tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: disorders in the prostate gland, 
30 vascular and connective tissues. Similarly, polypeptides and antibodies directed to 

those polypeptides are useful to provide immunological probes for differential 
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identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the male reproductive and urinary system and vascular 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g., reproductive, vascular, cancerous and wounded 
5 tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in prostate and smooth muscle tissues indicates that the 

10 protein products of this clone are useful for the diagnosis and/or treatment of prostate 
gland, vascular and connective tissue disorders. The tissue distribution in smooth 
muscle tissue indicates that the protein product of this gene is useful for the diagnosis 
and treatment of conditions and pathologies of the cardiovascular system, such as 
heart disease, restenosis, atherosclerosis, stoke, angina, thrombosis, and wound 

15 healing. The expression in the prostate tissue may indicate the gene or its products 

can be used in the disorders of the prostate, including inflammatory disorders, such as 
chronic prostatitis, granulomatous prostatitis and malacoplakia, prostatic hyperplasia 
and prostate neoplastic disorders, including adenocarcinoma, transitional cell 
carcinomas, ductal carcinomas, squamous cell carcinomas, or as hormones or factors 

20 with systemic or reproductive functions. Protein, as well as, antibodies directed 

against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO: 37 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 1268 of SEQ ID NO: 37, b 
is an integer of 15 to 1282, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:37, and where b is greater than or equal 
+ 14. 



WO 99/47540 



PCT/US99/05804 



55 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: ESSFVPPAAHSSLC (SEQ ID NO:248). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human pituitary 

tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: hyperpituitarism and 
hypopituitarism. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., endocrine, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of pituitary gland disorders 
such as hypopituitarism and hyperpituitarism. More generally, the tissue distribution 
in pituitary tissue suggests that the protein product of this clone is useful for the 
detection, treatment, and/or prevention of various endocrine disorders and cancers, 
particularly Addison's disease, Cushing's Syndrome, and disorders and/or cancers of 
the pancrease (e.g. diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper-, 
hypoparathyroidism) , hypothalamus, and testes. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 3 8 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 545 of SEQ ID NO:38, b 
is an integer of 15 to 559, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:38, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

LLPGQQEATQCVEAGAGEGALTPMCPCRQEQFVDLYKEF 

15 EPSLVNSTVYIMAMAIQMAPFAINYKVRPGPCXNIHCLPTQPHPMKPSVPHPH 
RARPSWRACPRTSPWCGVWQFHSWPSLACSSAPRPTSTASLASWTSLWSSS 
WSLPRSCSWTSAWRSWPTASCSSSWGPRS (SEQ ID NO:249), 
LLPGQQEATQCV EAGAGEGALTPMCPCRQEQFVDLYKEFEPSLVN (SEQ ID 
NO:250), STVYIMAMAIQMAPFAINYKVRPGPCXNIHCLPTQPHPMKPSVP 

20 (SEQ ID N O : 2 5 1 ) , 

HPHRARPSWRACPRTSPWCGVWQFHSWPSLACSSAPRPTSTA (SEQ ID 
NO:252), and/or SLASWTSLWSSSWSLPRSCSWTSAWRSWPTASCSSSWG PRS 
(SEQ ID NO:253). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

25 It has been discovered that this gene is expressed primarily in human pituitary 

and breast tissues, and to a lesser extent in endometrial and ovarian cancer tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: hyperpituitarism and 

30 hypopituitarism, and cancers of the female reproductive system. Similarly, 

polypeptides and antibodies directed to those polypeptides are useful to provide 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the endocrine and 
reproductive systems, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., endocrine, reproductive, 
5 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

10 136 as residues: Ser-3 to Lys-8. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of disorders in the pituitary 
gland. More generally, the tissue distribution in pituitary tissue suggests that the 
protein product of this clone is useful for the detection, treatment, and/or prevention 

15 of various endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Alternatively, the tissue distribution in breast tissue and cancerous tissues of 

20 the endometrium and ovaries suggests that the translation product of this gene is 
useful for the detection and/or treatment of disorders and cancers of the female 
reproductive system, as well as cancers of other tissues where expression has been 
observed. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:39 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 789 of SEQ ID NO:39, b 
is an integer of 15 to 803, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:39, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: TRNILSFIKCVIHNFWIPKESNEITIIINPYRETVCFSVEP 
VKKIFNY (SEQ ID NO:254). Polynucleotides encoding these polypeptides are also 
10 encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human synovial 
sarcoma tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample. Similarly, 
15 polypeptides and antibodies directed to those polypeptides are useful to provide 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skeletal system, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., skeletal, connective, cancerous and wounded tissues) 
20 or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
- having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
25 137 as residues: Thr-29 to Pro-34. 

The tissue distribution in synovial sarcoma tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of diseases of the 
synovium. In addition, the 

Expression of this gene product in synovium suggests a role in the detection 
30 and treatment of disorders and conditions affecting the skeletal system, in particular 
osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, trauma, 
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tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment 
of various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (ie. spondyloepiphyseal dysplasia 
5 congenita, familial arthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia 
type Schmid). Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

10 related to SEQ ID NO:40 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

15 general formula of a-b, where a is any integer between 1 to 1496 of SEQ ID NO:40, b 
is an integer of 15 to 1510, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:40, and where b is greater than or equal to a 
+ 14. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LVVLFASSNSRYLKYFFLVPLILGSAW (SEQ ID NO:255). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
It has been discovered that this gene is expressed primarily in human 
25 rhabdomyosarcoma and fetal liver/spleen tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: malignant neoplasms and 
hematopoiesis. Similarly, polypeptides and antibodies directed to those polypeptides 
30 are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells. 
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particularly of the skeletal and immune system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., musculoskeletal, immune, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
5 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
138 as residues: Gly~29 to Thr-35. 

The tissue distribution in rhabdomyosarcoma and fetal liver/spleen tissues 

10 indicates that the protein products of this clone are useful for diagnosis and treatment 
of skeletal and immune disorders. The expression in rhabdomyosarcoma tissue 
suggests that the protein product of this clone is useful for the detection, treatment, 
and/or prevention of various muscle disorders, such as muscular dystrophy, 
cardiomyopathy, fibroids, myomas, and rhabdomyosarcomas. Alternatively, 

15 Expression of this gene product in fetal liver/spleen tissue suggests a role in 

the regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 

20 cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
* and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 

25 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 

30 tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:41 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1081 of SEQ ID NO:41, b 
is an integer of 15 to 1095, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:41, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

It has been discovered that this gene is expressed primarily in fibrosarcoma 

15 tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: fibrosarcoma. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

20 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the connective 
tissue system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., musculoskeletal, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 

25 spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
139 as residues: Ser-34 to Gln-40, Gly-42 to Glu-48, Tyr-56 to Leu-62. 

30 The tissue distribution in only fibrosarcoma tissue suggests that the protein 

product of this clone is useful for the treatment, diagnosis and/or prognosis of 
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fibrosarcoma^ or other diorders related with fibrous tissue including fibroma, 
fibromatosis, fibromyoma, fibromyositis, fibrosis and fibrositis. Likewise, the 
expression in fibrosarcoma tissue suggests that the protein product of this clone is 
useful for the detection, treatment, and/or prevention of various muscle disorders, 

5 such as muscular dystrophy, cardiomyopathy, myomas, and rhabdomyosarcomas. 

Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

10 related to SEQ ID NO:42 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

15 general formula of a-b, where a is any integer between 1 to 1 148 of SEQ ID NO:42, b 
is an integer of 15 to 1 162, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:42, and where b is greater than or equal to a 
+ 14. 



20 FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

It has been discovered that this gene is expressed primarily in Hodgkins 
lymphoma and breast cancer tissues, and to a lesser extent in stromal cells and brain 
, tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
25 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: lymphoma, breast cancer, and 
neurological disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
30 particularly of the immune amd nervous systems, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g., immune, neural, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
140 as residues: Pro-22 to Lys-29. 

The tissue distribution in Hodgkins lymphoma, brain and breast cancer tissues 
suggests a role in the treatment, diagnosis and/or prognosis of breast cancer, immune 
and hematopoietic disorders including arthritis, asthma, immunodeficiency diseases, 
leukemia and Hodgkin's lymphoma and neurodegenerative disease states and 
behavioral disorders such as Alzheimer's Disease, Parkinson's Disease, Huntington's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder 
and panic disorder. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:43 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 643 of SEQ ID NO:43, b 
is an integer of 15 to 657, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:43, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: HEWKCKQKYSEGSGNTRIGN (SEQ ID NO: 256). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in chronic 
synovitis tissue, and to a lesser extent in fetal kidney and testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: synovitis, renal disorders and male 
infertility. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the connective tissue system, the renal system, and the male reproductive system, 
10 expression of this gene at significantly higher or lower levels may be detected in 

certain tissues or cell types (e.g., skeletal, renal, reproductive, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
15 having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
141 as residues: Met-33 to Pro-39, Ser-74 to Trp-79. 

The tissue distribution of this gene in chronic synovitis, testes, and kidneys 
suggests a role in the treatment, diagnosis and prognosis of synovial membrane 
20 disorders including synovitis, renal disorders including kidney failure, renal colic, 

renal diabetes, hypertension, osteodystrophy, tubular acidosis and kidney stones; and 
and male infertility. Furthermore, the tissue distribution in testes tissue indicates that 
the protein product of this clone is useful for the treatment and/or diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
25 maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
30 expression of transcripts that may be expressed, particularly at low levels, in other 
tissues of the body. Therefore, this gene product may be expressed in other specific 



otienru^irv. .\*ir\ 



WO 99/47540 



PCT/US99/05804 



65 

tissues or organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. In addition, the 

Expression of this gene product in synovium suggests a role in the detection 
and/or treatment of disorders and conditions affecting the skeletal system, in 
particular osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, 
trauma, tendonitis, chrondomalacia and inflammation), such as in the diagnosis or 
treatment of various autoimmune disorders such as rheumatoid arthritis, lupus, 
scleroderma, and dermatomyositis as well as dwarfism, spinal deformation, and 
specific joint abnormalities as well as chondrodysplasias (ie. spondyloepiphyseal 
dysplasia congenita, familial arthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:44 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 141 of SEQ ID NO:44, b 
is an integer of 15 to 1 155, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:44, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LLPLCFLGPRQVLEEFPSIV (SEQ ID NO:257). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in brain tissue, and 
to a lesser extent in osteoclastoma and testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: neurological disorders and male 
reproductive disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous system and the male reproductive system, expression of 

10 this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, reproductive, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 The tissue distribution of this gene in brain tissue suggests a role in the 

diagnosis, prognosis and/or treatment of neurodegenerative disease states and 
behavioural disorders such as Alzheimer's Disease, Parkinson's Disease, Huntinton's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder 
and panic disorder. In addition, the gene or gene product may also play a role in the 

20 treatment and/or detection of developmental disorders associated with the developing 
embryo, or sexually-linked disorders. Protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:45 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:45, b 
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is an integer of 15 to 1 1 12, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:45, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

" In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: PTRPS KHQE AGS (SEQ ID NO:258). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene 
encoding the disclosed cDNA is thought to reside on chromosome 3. Accordingly, 
10 polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 3. 

It has been discovered that this gene is expressed primarily in adult and fetal 
heart tissue, and to a lesser extent in fetal lung and fetal liver/spleen tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cardiovascular and immune 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

20 the vascular and immune systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., vascular, immune, 
pulmonary, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

25 healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
143 as residues: Val-2 to Ser-14. 

The tissue distribution in heart, fetal liver and fetal spleen tissues suggests a 
role in the treatment and/or diagnosis of cardiovascular disorders including 

30 myocardial infarction, congestive heart failure, coronary failure, as well as immune 
disorders including autoimmune diseases, such as lupus, transplant rejection, allergic 
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reactions, arthritis, asthma, immunodeficiency diseases, leukemia, and AIDS. 
Furthermore, the tissue distribution in adult and fetal heart tissue indicates that the 
protein product of this gene is useful for the diagnosis and treatment of conditions and 
pathologies of the cardiovascular system, such as heart disease, restenosis, 
5 atherosclerosis, stoke, angina, thrombosis, and wound healing. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

10 related to SEQ ID NO:46 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

15 general formula of a-b, where a is any integer between 1 to 4009 of SEQ ID NO:46, b 
is an integer of 15 to 4023, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:46, and where b is greater than or equal to a 
+ 14. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

It has been discovered that this gene is expressed primarily in testes tissues. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: male infertility and reproductive 

25 disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the male reproductive system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., reproductive, cancerous 

30 and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
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standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

The tissue distribution in testes tissues suggests a role in the treatment and/or 
diagnosis of male infertility, and testicular disorders including cancer. Furthermore, 
the tissue distribution in testes tissue indicates that the protein product of this clone is 
useful for the treatment and diagnosis of conditions concerning proper testicular 
function (e.g. endocrine function, sperm maturation), as well as cancer. Therefore, 
this gene product is useful in the treatment of male infertility and/or impotence. This 
gene product is also useful in assays designed to identify binding agents, as such 
agents (antagonists) are useful as male contraceptive agents. Similarly, the protein is 
believed to be useful in the treatment and/or diagnosis of testicular cancer. The testes 
are also a site of active gene expression of transcripts that may be expressed, 
particularly at low levels, in other tissues of the body. Therefore, this gene product 
may be expressed in other specific tissues or organs where it may play related 
functional roles in other processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:47 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 528 of SEQ ID NO:47, b 
is an integer of 15 to 542, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:47, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

It has been discovered that this gene is expressed primarily in apoptotic T- 
cells, and to a lesser extent in brain tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
5 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and neurological 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

10 the immune and nervous systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, neural, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 

15 tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
145 as residues: Glu-33 to Tyr-42. 

The tissue distribution in apoptotic T~cells suggests potential roles in the 
treatment and/or diagnosis of immune disorders including of immune and 

20 autoimmune diseases, such as lupus, transplant rejection, allergic reactions, arthritis, 
asthma, immunodeficiency diseases, leukemia, and AIDS. Alternatively, expression 
in brain tissue suggests a role in the treatment and/or diagnosis of neurodegenerative 
disease states and behavioural disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntinton's Disease, schizophrenia, mania, dementia, paranoia, obsessive 

25 compulsive disorder and panic disorder. Furthermore, the tissue distribution in 
apoptotic T-cells indicates that the translation product of this gene may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:48 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1481 of SEQ ID NO:48, b 
is an integer of 15 to 1495, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:48, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

The translation product of this gene shares sequence homology with 
phosphomannomutase, which is thought to be important in mannose matabolism. 

It has been discovered that this gene is expressed primarily in meningioma and 
15 testis tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: meningioma related diseases. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 

20 provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., neural, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 

25 fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
146 as residues: Ser-33 to Lys-43. 
30 The tissue distribution in meningioma, and the homology to 

phosphomannomutase, suggests that the protein product of this clone is useful for the 
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diagnosis and/or intervention of meningioma related diseases. For example, the gene 
product can be used for preventing microbial infection of the meninges, for imaging 
conjugates, or as a secretory factor as a endocrine with systemic, central or peripheral 
nerve functions. Protein, as well as, antibodies directed against the protein may show 
5 utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST 'sequences, are [.publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:49 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

10 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 804 of SEQ ID NO:49, b 
is an integer of 15 to 8 18, where both a and b correspond to the positions of 

15 nucleotide residues shown in SEQ ID NO:49, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

It has been discovered that this gene is expressed primarily in tonsils, 

20 osteoclastoma and retinoic acid treated teratocarcinoma cells, and to a lesser extent in 
macrophages, female bladder, adipose tissue, myeloid progenitor cells, prostate tissue, 
and number of other tissues and organs. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

25 diagnosis of the following diseases and conditions: tonsils and osteoclast related 
diseases. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and bone systems, expression of this gene at significantly higher or lower 

30 levels may be detected in certain tissues or cell types (e.g., immune, skeletal, 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
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synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
147 as residues: Glu-55 to Arg-61, Gln-84 to Ser-92, Ser-99 to Ser-104. 

The tissue distribution in tonsils and osteoclastoma suggests that the protein 
product of this clone is useful for the diagnosis and/or intervention of diseases related 
to tonsils or osteoclasts. For example, tonsillitis, adenoids, peritonsilar abscess, 
neoplasms, or bone related disorders like rickets, abnormalities of bone growth and 
modelling, facture, osteonecrosis, and osteoporosis etc. Expression of this gene 
product in osteoclastoma suggests that it may play a role in the survival, proliferation, 
and/or growth of osteoclasts. Therefore, it may be useful in influencing bone mass in 
such conditions as osteoporosis. 

Alternatively, the expression of this gene product in tonsils suggests a role in 
the regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:50 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1697 of SEQ ID NO:50, b 
is an integer of 15 to 171 1, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:50, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

It has been discovered that this gene is expressed primarily in resting T-cells. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

15 diagnosis of the following diseases and conditions: T-cell related disorders. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in 

20 certain tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in resting T-cells suggests that the protein product of 

25 this clone is useful for the diagnosis and/or intervention of T-cell related disorders, 
such as infection, inflammation, allergy, tissue/organ transplantation, immune 
deficiency etc. Furthermore, the expression of this gene product in T cells also 
strongly suggests a role for this protein in immune function and immune surveillance. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

30 marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:5 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 735 of SEQ ID NO:51, b 
is an integer of 15 to 749, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:51, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

The translation product of this gene shares weak sequence homology with 
Human metastasis suppressor KiSS-1 fragment, which is thought to be important in 
the diagnosis, prevention, staging and/or treatment of cancers, such as melanoma (See 
Accession No. W 15789). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GQGPAGRWVRRLPCSRRAGGERGPHWGVWAGPQM 
SCGLXFGP (SEQ ID NO:259), WRTQGPMVLLWVVTCPATMLTEPQNPHLIGF 
VAYSGPSHTTQPHKYWLLLDGQADPAAAEGPVKRKAASVVWWPQALRHLS 
LLVHCWEESYEMNIGCQSLWAGGLASSGNGWDLGVAFRRDTCMSSSSLHW 
KEFKYAPGSLHYFALSFVLILTEICLVSSGMGFPQEGKHFSVLGSPDCSLWGR 
DEHVPREFA (SEQ ID NO: 260), 

WRTQGPMVLLWVVTCPATMLTEPQNPHLIGFVAY SGPSHTTQ (SEQ ID 
NO:261), PHKYWLLLDGQADPAAAEGPVKRKAASVVWW PQALRHLSLL 
(SEQ ID NO:262), VHCWEESYEMNIGCQSLWAGGLASSGNGW 
DLGVAFRRDTCM (SEQ ID NO:2 6 3 ), 

SSSSLHWKEFKYAPGSLHYFALSFVLILT EICLVSSGMGFPQEG (SEQ ID 
NO:264), and/or KHFSVLGSPDCSLWGRDEHV PREFA (SEQ ID NO:265). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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The gene encoding the disclosed cDNA is thought to reside on chromosome 1. 
Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1. 

It has been discovered that this gene is expressed primarily in tonsils, 
5 osteoclastoma and teratocarcinoma tissues, and to a lesser extent in female bladder, 
adipose tissue, myeloid progenitor, prostate tissue, and number of other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: diseases related to tonsils and 

10 osteoclasts. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and bone system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., immune, skeletal, 

15 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in tonsils and osteoclastoma tissues suggests that the 

20 protein product of this clone is useful for the diagnosis and/or treatment of diseases 
related to tonsils and osteoclasts. For example, tonsillitis, adenoids, peritonsilar 
abscess, neoplasms, or abnormal growth and modelling of the bone, osteonecrosis, 
.osteoporosis, osteodystrophy, osteoclastoma etc. Expression of this gene product in 
osteoclastoma suggests that it may play a role in the survival, proliferation, and/or 

25 growth of osteoclasts. Therefore, it may be useful in influencing bone mass in such 
conditions as osteoporosis. 

Moreover, the expression of this gene product in tonsils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 

30 product may be involved in the regulation of cytokine production, antigen 
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presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as-an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:52 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1077 of SEQ ID NO:52, b 
is an integer of 15 to 1091, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:52, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

The translation product of this gene shares sequence homology with the 

Drosophila gene "maleless", which is one of four known regulatory loci required for 

increased transcription (dosage compensation) of X-linked genes (See Genbank 

Accession No.: gil 157906). 

It has been discovered that this gene is expressed primarily in normal prostate 

tissue, testes tissue, whole 6-week old embryonic tissue, human colon carcinoma 
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(HCC) cell line, and cerebellum tissue, and to a lesser extent in primary breast cancer, 
activated T-cells, and many other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: diseases of the prostate or colon, 
or male reproductive disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the prostate or colon carcinoma, and male reproductive 

10 disorders, expression of this gene at significantly higher or lower levels may be 

detected in certain tissues or cell types (e.g., colon, prostate, reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 

15 individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
150 as residues: Val-39 to Ala-45. 

The tissue distribution in colon and prostate tissues suggests that the protein 
product of this clone is useful for the diagnosis and/or treatment of prostate disorders 

20 such as prostatitis, prostatic hyperplasia, prostate cancers, or human colon carcinoma, 
as well as cancers of other tissues where expression has been observed. Alternatively, 
the tissue distribution in testes tissue, in conjunction with the homology to the 
Drosophila maleless gene, suggests that the translation product of this gene is useful 
for the detection and/or treatment of disorders involving the testes or the transcription 

25 of X-linked genes. Furthermore, the tissue distribution indicates that the protein 
product of this clone is useful for the treatment and diagnosis of conditions 
concerning proper testicular function (e.g. endocrine function, sperm maturation), as 
well as cancer. Therefore, this gene product is useful in the treatment of male 
infertility and/or impotence. 

30 This gene product is also useful in assays designed to identify binding agents, 

as such agents (antagonists) are useful as male contraceptive agents. Similarly, the 
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protein is believed to be useful in the treatment and/or diagnosis of testicular cancer. 
The testes are also a site of active gene expression of transcripts that may be 
expressed, particularly at low levels, in other tissues of the body. Therefore, this gene 
product may be expressed in other specific tissues or organs where it may play related 
functional roles in other processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:53 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2240 of SEQ ID NO:53, b 
is an integer of 15 to 2254, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:53, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

The translation product of this gene shares weak sequence homology with 
Eimeria antigen Eam45 M3, which is thought to be important in uses as a vaccine for 
protecting chickens against coccidiosis. 

It has been discovered that this gene is expressed primarily in adrenal gland 
tissue, and to a lesser extent in activated T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: adrenal cortical insufficiency, 
adrenal cortical hyperfunction, neoplasia. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
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differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., endocrine, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
5 plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

The tissue distribution in adrenal gland tissue suggests that the protein product 
of this clone is useful for the diagnosis and/or intervention of disorders caused by 

10 adrenal gland abnormalities, such as adrenal cortical insufficiency, adrenal cortical 
hyperfunction, and neoplasia. More generally, the tissue distribution suggests that the 
protein product of this clone is useful for the detection, treatment, and/or prevention 
of various endocrine disorders and cancers, particularly Addison's disease, Gushing' s 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 

15 adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothallamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

20 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 54 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
„ excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

25 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 472 of SEQ ID NO:54, b 
is an integer of 15 to 486, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:54, and where b is greater than or equal to a 
+ 14. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 
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The translation product of this gene shares sequence homology with neural 
thread protein, tumor necrosis factor related gene product, human alpha- 1C2 
adrenalin receptor, which is thought to be important for diagnosing the presence of 
Alzheimer's disease, neuroectodermal tumours and a malignant astrocytoma, or 
5 diagnosis of hepatocellular carcinomas and preneoplastic or pathological conditions 
of the liver, and tumor immunity. 

It has been discovered that this gene is expressed primarily in activated T-cells 
and endothelial cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
10 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: Alzheimer's disease, 
neuroectodermal tumours and a malignant astrocytoma, hepatocellular carcinomas 
and tumors of various origins. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
1 5 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and endothelial cells, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, endothelial, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
20 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
152 as residues: Arg-38 to Arg-47. 

The tissue distribution in immune and endothelial tissues, and the homology to 
25 neural thread protein, tumor necrosis factor related gene product, human alpha- 1C2 
adrenalin receptor, or Smaller hepatocellular oncoprotein (hhcm) gene product 
suggests that the protein product of this clone is useful for the diagnosis and/or 
treatment of tumors of various origins, including neuroectodermal tumours and a 
malignant astrocytoma, hepatocellular carcinomas, as well as syndromes inflicted by 
30 these cancers. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:55 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1256 of SEQ ID NO:55, b 
is an integer of 15 to 1270, where both a and b correspond to the positions of 

10 nucleotide residues shown in SEQ ID NO:55, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

It has been discovered that this gene is expressed primarily in tumor tissues 

15 such as hepatocellular tumor, hemangiopericytoma, chronic lymphocytic leukemia, 
and activated T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tumors of various origins. 

20 Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hepatocellular tumor, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., liver, immune, cancerous and 

25 wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

The tissue distribution in hepatocellular tumors suggests that the protein 

30 product of this clone is useful for the diagnosis and/or targeting of hepatocellular 

carcinomas, preneoplastic or pathological conditions of the liver, Alzheimer's disease, 
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neuroectodermal tumours and malignant astrocytoma. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:56 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2045 of SEQ ID NO:56, b 
is an integer of 15 to 2059, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:56, and where b is greater than or equal to a 
+ 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

It has been discovered that this gene is expressed primarily in glioblastoma, 
ulcerative colitis, and hemangiopericytoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

20 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: glioblastoma, 
hemangiopericytoma and their inflicted disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

25 of the above tissues or cells, particularly of the brain tissues, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., neural, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

30 healthy tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
154 as residues: Pro-31 to Ala-37. 

The tissue distribution suggests that the protein product of this clone would be 
useful for the diagnosis, targeting and/or treatment of tumors in the brain, such as 
5 glioblastoma and hemangiopericytoma. Additionally, the gene products can be useful 
agent for the diagnosis and treatment of ulcerative colitis. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:57 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 854 of SEQ ID NO:57, b 
is an integer of 15 to 868, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:57, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

It has been discovered that this gene is expressed primarily in bone marrow. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

25 diagnosis of the following diseases and conditions: immunodeficiency, tumor 

necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

30 the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 
5 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

- -155 as residues: Thr-47 to Val-53. 

The tissue distribution in bone marrow suggests that the protein product of this 
clone is useful for the diagnosis and/or treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 

10 supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. Furthermore, the tissue distribution in bone marrow suggests 
that the protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 

1 5 thrombocytopenia or leukemia. 

The uses include bone marrow cell ex vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 

20 immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:58 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 972 of SEQ ID NO:58, b 
is an integer of 15 to 986, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 58, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

It has been discovered that this gene is expressed primarily in bone marrow. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

10 diagnosis of the following diseases and conditions: immunodeficiency, tumor 

necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

1 5 the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

20 healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
156 as residues: Leu-40 to Cys-47. 

The bone marrow tissue distribution suggests that the protein product of this 
clone would be useful for the diagnosis and treatment of immune disorders including: 

25 leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. Furthermore, the tissue distribution in bone marrow suggests 
that the protein product of this clone is useful for the treatment and diagnosis of 

30 hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia. 
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The uses include bone marrow cell ex vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
5 immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:59 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 681 of SEQ ID NO:59, b 
is an integer of 15 to 695, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:59, and where b is greater than or equal to a 

20 + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: IAQGTVPLTKRGVQSSGPDYPEGTLTPLPRG (SEQ ID 
25 NO:266 and 267). Polynucleotides encoding these polypeptides are also encompassed 
by the invention. 

It has been discovered that this gene is expressed primarily in dendritic cells. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
30 diagnosis of the following diseases and conditions: immune disorders and related 

conditions such as leukemias, lymphomas, inflammation, hematopoeitic disfunction, 
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arthritis and asthma. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of dendritic cells. For a number of disorders of the above tissues or cells, particularly 
of the immune system, expression of this gene at significantly higher or lower levels 
5 may be detected in certain tissues or cell types (e.g., dendritic cells, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

1 57 as residues: Ser-25 to Phe-3 1 , Lys-55 to Arg-6 1 . 

The tissue distribution in dendritic cells suggests that the protein product of 
this clone is useful for the diagnosis and/or treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 

15 supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. 

Moreover, the expression of this gene product in dendritic cells also strongly 
suggests a role for this protein in immune function and immune surveillance. Protein, 

20 as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:60 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 300 of SEQ ID NO:60, b 

30 is an integer of 15 to 314, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:60, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: DCLYLALSFPWHCHCHHHPPSGSLLYPF (SEQ ID 
NO:268). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The translation product of this gene shares sequence homology with a C. 
elegans protein of unknown function (See Genbank Accession No.: gill 947 142 
(AF000264)). 

It has been discovered that this gene is expressed primarily in healing 
abdominal wound tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tissue necrosis, wound healing, 
ulceration, neoplasms or cancer. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of injured tissue, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues or cell types (e.g., vascular, 
endothelial, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
158 as residues: Pro-34 to Tyr-43, Gln-73 to Cys-86, Pro-98 to Leu-103. 

The tissue distribution in healing abdominal wound tissue suggests that the 
protein product of this clone is useful for the treatment and/or diagnosis of conditions 
involving tissue repair and wound healing. Tissue repair may be indicated in cases of 
injury to the skin or internal organs, ulceration, cellular necrosis or other conditions 
involving healing of both diseased or non-diseased, traumatized tissue. In addition, 
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because of the implications of tissue regeneration, remoldeling and growth regulation, 
the protein product of this gene may have indications in the diagnosis and treatment 
of neoplasms and cancer. 

More generally, the tissue distribution in endothelial tissue indicates that the 
5 protein product of this gene is useful for the diagnosis and treatment of conditions and 
pathologies of the cardiovascular system, such as heart disease, restenosis, 
atherosclerosis, stoke, angina, thrombosis, and wound healing. Likewise, the tissue 
distribution further suggests that the protein product of this clone is useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 

10 disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 
carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e. wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 

15 photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 

dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 
erythema, petechiae, purpura, and xanthelasma. In addition, such disorders may 
predispose increased susceptibility to viral and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 

20 erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, the protein 
product of this clone may also be useful for the treatment or diagnosis of various 
connective tissue disorders such as arthritis, trauma, tendonitis, chrondomalacia and 
inflammation, autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, 
and dermatomyositis as well as dwarfism, spinal deformation, and specific joint 

25 abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal dysplasia 
congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:61 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 720 of SEQ ID NO:61, b 
is an integer of 15 to 734, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:61, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

The translation product of this gene shares sequence homology with FAR- 
17 A, which is an androgen induced protein, absent in castrated hamsters (See 
Genbank Accession No.: gil 191315), as well as a male hormone-dependent gene 

15 product (See GenSeq Accession No.: R10612). The gene encoding the disclosed 

cDNA is thought to reside on chromosome 6. Accordingly, polynucleotides related to 
this invention are useful as a marker in linkage analysis for chromosome 6. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: ASLPPSRSRPLANMALVPCQVLRMAILLSYCSILCNYKA 

20 IEMPSHQTYGGSWKFLTFIDLVIQAVFFGICVLTDLSSLLTRGSGNQEQERQLK 
KLISLRDW (SEQ ID NO:269). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fetal liver and 
spleen tissue, and to a lesser extent in a variety of other fetal tissues and brain tissues. 

25 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune disorders including 
leukemias, lymphomas; reproductive and endocrine disorders, including testicular 
cancer; and liver disorders (e.g. hepatoblastoma, metabolic diseases and conditions 

30 that are attributable to the differentiation of hepatocyte progenitor cells). Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
.a number of disorders of the above tissues or cells, particularly of the immune and 
reproductive systems, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., immune, reproductive, cancerous 
5 and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

10 159 as residues; Thr-59 to Gly-70, Tyr-132 to Glu-150. 

The tissue distribution and homology to FAR-17A suggests that the protein 
product of this clone is useful for the treatment and/or diagnosis of androgen related 
conditions and disorders. Male reproductive and endocrine disorders would be 
potential area of application (e.g. endocrine function, sperm maturation). It may also 

15 prove to be valuable in the diagnosis and treatment of testicular cancer. 

More generally, the protein product of this clone may be useful for the 
treatment and/or diagnosis of conditions concerning proper testicular function (e.g. 
endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This gene product is 

20 also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. Similarly, the protein is believed to be useful 
in the treatment and/or diagnosis of testicular cancer. The testes are also a site of 
active gene expression of transcripts that may be expressed, particularly at low levels, 
in other tissues of the body. Therefore, this gene product may be expressed in other 

25 specific tissues or organs where it may play related functional roles in other 

processes, such as hematopoiesis, inflammation, bone formation, and kidney function, 
to name a few possible target indications. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:62 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1396 of SEQ ID NO:62, b 
is an integer of 15 to 1410, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:62, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP-1 to calcium. 
Thus it is likely that the product of this gene is involved in a signal transduction 
15 pathway that is initiated when the product binds a receptor on the surface of the 

plasma membrane of monocytes, and to a lesser extent, in immune or hematopoietic 
cells and tissues. Thus, polynucleotides and polypeptides have uses which include, 
but are not limited to, activating monocytes. 

In specific embodiments, polypeptides of the invention comprise the following 
20 amino acid sequence: MSRSSRISGLSCPWLL (SEQ ID NO:270). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene 
encoding the disclosed cDNA is believed to reside on chromosome 1 . Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 1. 

25 It has been discovered that this gene is expressed primarily in T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases and/or disorders. Similarly, polypeptides and antibodies directed to those 

30 polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
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particularly of the immune and haemopoietic systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

5 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
160 as residues: Pro-42 to Cys-50, Leu-61 to Ala-66. 

The tissue distribution in T-cells, combined with the detected calcium flux 

10 activity in monocytes suggests that the protein product of this clone would be useful 
for the treatment and diagnosis of immune and hematopoietic disorders. Morever, the 
expression of this gene product suggests a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product may be involved in the regulation of cytokine 

15 production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

20 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

25 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 

30 expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Protein, as well as, 
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antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO: 63 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 1 2 1 7 of SEQ ID NO:63, b 
is an integer of 1 5 to 1 23 1 , where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:63, and where b is greater than or equal to a 
+ 14. 

1 5 FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: DHWPAGFLPPAPGLKFPVALEVFRKVLPAVCPTDCSGS 
AGKERNS (SEQ ID NO:271). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

20 ^ has been discovered that this gene is expressed primarily in liver. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: metabolic diseases and liver 
conditions. Similarly, polypeptides and antibodies directed to those polypeptides are 

25 useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the metabolic system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., hepatic, liver, metabolic, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, bile, serum, plasma, 

30 urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
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relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
161 as residues: Ser-31 to Gln-41. 
5 The tissue distribution in liver suggests that the protein product of this clone 

would be useful for treatment and diagnosis of disorders of the metabolic system and 
liver disorders. Morever, the protein product of this clone is useful for the detection 
and treatment of liver disorders and cancers (e.g. hepatoblastoma, jaundice, hepatitis, 
liver metabolic diseases and conditions that are attributable to the differentiation of 

10 hepatocyte progenitor cells). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

15 related to SEQ ID NQ:64 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

20 general formula of a-b, where a is any integer between 1 to 598 of SEQ ID NO:64, b 
is an integer of 15 to 612, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:64, and where b is greater than or equal to a 
+ 14. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 55 

When tested against PC 12 cell lines, supernatants removed from cells 
containing this gene activated the EGR1 (early growth response gene 1) promoter 
element. Thus, it is likely that this gene activates sensory neuron cells, and to a lesser 
extent in other neural cells and tissues, through the EGR1 signal transduction 
30 pathway. EGR1 is a separate signal transduction pathway from Jak-STAT, genes 
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containing the EGR1 promoter are induced in various tissues and cell types upon 
activation, leading the cells to undergo differentiation and proliferation. 

It has been discovered that this gene is expressed primarily in T-cells and 
monocytes, and to a lesser extent in cancerous tissues, including cancerous colon 
5 tissue and placenta. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and haemopoietic 
disorders and cancer such as colon cancer, but also such cancers as breast cancer, 
10 cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, glioblastoma, lung 

cancer, intestinal cancer, testicular cancer, stomach cancer, neuroblastoma, myxoma, 
myoma, lymphoma, endothelioma, osteoblastoma, osteoclastoma, adenoma, and the 
like. Similarly, polypeptides and antibodies directed to those polypeptides are useful 
to provide immunological probes for differential identification of the tissue(s) or cell 
15 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune and haemopoietic systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
20 such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
162 as residues: Glu-63 to Trp-72. 

The tissue distribution in T-cells and monocytes, combined with the detected 
25 EGR1 biological activity suggests that the protein product of this clone would be 
useful for treatment and diagnosis of disorders of the immune and haemopoietic 
systems and colon and other cancers. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes suggesting 
a usefulness in the treatment of cancer (e.g. by boosting immune responses). 
30 Since the gene is expressed in cells of lymphoid origin, the natural gene 

product may be involved in immune functions. Therefore it may be also used as an 



WO 99/47540 



PCT/US99/05804 



agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
5 transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 

10 differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Expression cellular 
sources marked by proliferating cells suggests this protein may play a role in the 

15 regulation of cellular division, and may show utility in the diagnosis and treatment of 
cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 

20 death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 

25 apoptosis or tissue differentiation and would be useful in the detection, treatment, 

and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:65 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 2256 of SEQ ID NO:65, b 
is an integer of 15 to 2270, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:65, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

The translation product of this gene has homology with several human keratin 
genes at the nucleotide level (see, for example, Troyanovsky, et al., Eur. J. Cell Biol. 
59:127-137 (1992) which is hereby incorporated by reference herein). Based on the 
sequence similarity, the translation product of this clone is expected to share 

15 biological activities with keratin and growth factor proteins. Such activities are known 
in the art, and some of which are described elsewhere herein. 

It has been discovered that this gene is expressed primarily in neutrophils. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: immune and haemopoietic 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and haemopoietic system, expression of this gene at significantly higher 

25 or lower levels may be detected in certain tissues or cell types (e.g., cancerous and 

wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

30 The tissue distribution in neutrophils suggests that the protein product of this 

clone would be useful for treatment and diagnosis of disorders of the immune and 
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haemopoietic system. Furthermore, sequence homology of the polynucleotides and 
polypeptides of the present invention with a number of human cytokeratin molecules, 
such as CK-8, CK-15, and CK-17, indicate that molecules of the present invention 
can be used diagnostically as markers of basal cell differentiation in complex epithelia 
5 and therefore indicative of a certain type of epithelial stem cells, as well as markers of 
the differentiation of other cell types such as neutrophils or other immune cells. 
Molecules of the present invention, or agonists or antagonists thereof, can also be 
used therapeutically to treat differentiation disorders of epithelial, neutrophil or other 
immune cell differentiation or activation. Protein, as well as, antibodies directed 

10 against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:66 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1269 of SEQ ID NO:66, b 

20 is an integer of 15 to 1283, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO: 66, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 
25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: EEIATSIEPIRDFLAIVFFASIGLHVFPTFVAYELTVLVF 

LTLSVVV (SEQ ID NO:272). Polynucleotides encoding these polypeptides are also 

encompassed by the invention. 

It has been discovered that this gene is expressed primarily in synovium, 
30 placenta, and stromal cells, and to a lesser extent in several other tissues and organs, 
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including, among others, bone marrow, palate, pituitary gland, and in tissue derived 
from osteosarcoma and chondrosarcoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: developmental disorders, as well 
as disorders of the musculoskeletal and haematopoietic systems, and cancers 
including especially osteosarcoma and chondrosarcoma, but also other cancers 
including breast cancer, colon cancer, cardiac tumors, pancreatic cancer, melanoma, 
retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular cancer, 

10 stomach cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, 

osteoblastoma, osteoclastoma, adenoma, and the like. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the haemopoietic and musculoskeletal 

15 systems, as well as developmental disorders, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues or cell types (e.g., synovium, 
placenta, stromal, immune, hematopoietic, skeletal, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 

20 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
164 as residues: Pro-81 to Ser-88. 

The tissue distribution in placenta suggests that the protein product of this 

25 clone would be useful for treatment and diagnosis of developmental disorders. 

Polynucleotides and polypeptides of the present invention can be used diagnostically 
and therapeutically to detect and treat many cancers, particularly osteosarcoma and 
chondrosarcoma. In addition, the expression of this gene product in synovium would 
suggest a role in the detection and treatment of disorders and conditions affecting the 

30 skeletal system, in particular osteoporosis, bone cancer, as well as, disorders afflicting 
connective tissues (e.g. arthritis, trauma, tendonitis, chrondomalacia and 
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inflammation), such as in the diagnosis or treatment of various autoimmune disorders 
such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well as 
dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
5 osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). 

Moreover, the protein is useful in the detection, treatment, and/or prevention 
of a variety of vascular disorders and condtions, which include, but are not limited to 
miscrovascular disease, vascular leak syndrome, aneurysm, stroke, embolism, 
thrombosis, coronary artery disease, arteriosclerosis, and/or atherosclerosis. Protein, 

10 as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:67 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1249 of SEQ ID NO:67, b 

20 is an integer of 15 to 1263, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:67, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 
25 Contact of cells with supernatant expressing the product of this gene has been 

shown to increase the permeability of the plasma membrane of renal messiaglia cells 
to calcium. Thus it is likely that the product of this gene is involved in a signal 
transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of renal and developing cells and tissues.Thus, 
30 polynucleotides and polypeptides have uses which include, but are not limited to, 
activating renal and developing cells and tissues. 



au<innr.«n- <-wn 0047(^10 a 1 » ^ 



WO 99/47540 



PCT/US99/05804 



In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: YCNLQCR (SEQ ID NO:273). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in the whole 
5 developing embryo, as well as in ovarian cancer and placenta. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental or reproductive 
diseases and/or disorders, in addition to the following and ovarian cancer, as well as 

10 other cancers including breast cancer, colon cancer, cardiac tumors, pancreatic cancer, 
melanoma, retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular 
cancer, stomach cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, 
osteoblastoma, osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, and the like. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 

15 provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
developing and fetal system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., developmental, 
reproductive, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 

20 amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in embryonic and ovarian tissue, combined with the 
detected calcium flux activity, suggests that the protein product of this clone would be 

25 useful for tretment and diagnosis of developmental disorders as well as ovarian and 
other cancers. Expression within embryonic tissue and other cellular sources marked 
by proliferating cells suggests this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis and treatment of cancer and other 
proliferative disorders. Similarly, developmental tissues rely on decisions involving 

30 cell differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis 
can result in inappropriate suppression of cell death, as occurs in the development of 
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some cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
5 useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and would be useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. 
Alternatively, the protein is useful in the detection, treatment, and/or prevention of 

10 vascular conditions, which include, but are not limited to, microvascular disease, 
vascular leak syndrome, aneurysm, stroke, atherosclerosis, arteriosclerosis, or 
embolism. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 

15 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 68 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

20 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1603 of SEQ ID NO:68, b 
is an integer of 15 to 1617, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:68, and where b is greater than or equal to a 
+ 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: S ALIGNPKGCFGCFS P V VLRE WS VES WKS LRPFQ AICK 
LKTNFR (SEQ ID NO:274). Polynucleotides encoding these polypeptides are also 
30 encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in hypothalamus 
and anergic T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: neurological and inflammatory 
defects, diseases, and/or disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous and immune systems, expression of 

10 this gene at significantly higher or lower levels may be detected in certain tissues 

(e.g., neural, immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

166 as residues: His-33 to Trp-38. 

The tissue distribution in hypothalamus and T-cells suggests that the protein 
product of this clone would be useful for study and treatment of immune and nervous 
system disorders. The protein product of this clone is useful for the detection, 

20 treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 

25 aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 

30 in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Morever, the expression of this gene product suggests a 
role in regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
5 processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

10 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

15 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 

20 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:69 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1375 of SEQ ID NO:69, b 
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is an integer of 15 to 1389, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:69, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

The translation product of this gene shares nucleotide sequence homology 
with the human PKD1 gene which is thought to be important in polycystic kidney 
disease. 

This gene is expressed widely with a predominant expression exhibited in 

10 liver, pediatric kidney, and in the whole 8 week old developing human embryo. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cancer, growth, renal, and 
metabolic defects, diseases, and/or disorders. Similarly, polypeptides and antibodies 

15 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine, digestive and immune 
systems, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., renal, metabolic, hepatic, developmental, 

20 and cancerous and wounded tissues) or bodily fluids (e.g., lymph, amniotic fluid, bile, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

The tissue distribution in pediatric kidney suggests that the protein product of 

25 this clone would be useful for study and treatment of renal and general neoplasias and 
growth and development disorders. The protein product of this clone could be used in 
the treatment and/or detection of kidney diseases including renal failure, nephritus, 
renal tubular acidosis, proteinuria, pyuria, edema, pyelonephritis, hydronephritis, 
nephrotic syndrome, crush syndrome, glomerulonephritis, hematuria, renal colic and 

30 kidney stones, in addition to Wilm's Tumor Disease, and congenital kidney 

abnormalities such as horseshoe kidney, polycystic kidney, and Falconi's syndrome. 
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Moreover, the expression within embryonic tissue suggests this protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis 
and treatment of cancer and other proliferative disorders, particularly of the liver and 
other organs. Similarly, developmental tissues rely on decisions involving cell 

5 differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 

10 present invention are useful in treating, detecting, and/or preventing said disorders 

and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

15 marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:70 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1882 of SEQ ID NO:70, b 
is an integer of 15 to 1896, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO:70, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

In specific embodiments, polypeptides of the invention comprise the following 
30 amino acid sequence: HEAALRGP (SEQ ID NO:275). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 



WO 99/47540 



PCT/US99/05804 



109 



It has been discovered that this gene is expressed primarily in human striatum 
depression. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: stroke, in addition to other, 

neurologically-related diseases and/or defects. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system, expression of 

10 this gene at significantly higher or lower levels may be detected in certain tissues 
(e.g., neural, musculoskeletal, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

I 5 ? Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

168 as residues: Glu-50 to Glu-61. 

The tissue distribution in human striatum depression suggests that the protein 
product of this clone would be useful for study and treatment of central nervous 
system orders, such as seizures and other neurological conditions. The protein product 

20 of this clone is useful for the detection, treatment, and/or prevention of 

neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 

25 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 

30 in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:71 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 294 of SEQ ID NO:71, b 
is an integer of 15 to 308, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:71, and where b is greater than or equal to a 
+ 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This clone has homology to a cystine rich granulin peptide(s) from 
leucocyte(s) which has been termed Granulin E. Granulins inhibit keratinocytes and is 
useful topically for wound healing. The gene encoding the disclosed cDNA is 

20 believed to reside on chromosome 3. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 3. 

It has been discovered that this gene is expressed primarily in infant brain. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

25 diagnosis of the following diseases and conditions: neurological, developmental, and 
growth defects. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetus and the nervous system, expression of this gene at 

30 significantly higher or lower levels may be detected in certain tissues (e.g., neural, 
developmental, growth, and cancerous and wounded tissues) or bodily fluids (e.g., 



WO 99/47540 



PCT/US99/05804 



lymph, amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from 
an individual having such a disorder, relative to the standard gene expression level, 
i.e., the expression level in healthy tissue from an individual not having the disorder. 
Based on the strong conservation of cysteine residues, the polypeptide of the present 
5 invention can be used to inhibit keratinocytes and promote wound healing. 

The tissue distribution in infant brain suggests that the protein product of this 
clone would be useful for study and treatment of nervous system, neurodegenerative 
and developmental disorders. The protein product of this clone is useful for the 
detection, treatment, and/or prevention of neurodegenerative disease states, 
10 behavioral disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
15 compulsive disorder, depression, panic disorder, learning disabilities, ALS, 

psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
20 formation, neurotransmission, learning, cognition, homeostasis, or neuronal 

differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. The homology to granulin proteins suggest the protein product of this clone is 
useful for the treatment, diagnosis, and/or prevention of various skin disorders 
25 including congenital disorders (i.e. nevi, moles, freckles, Mongolian spots, 

hemangiomas, port-wine syndrome), integumentary tumors (i.e. keratoses, Bowen's 
disease, basal cell carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s 
disease, mycosis fungoides, and Kaposi's sarcoma), injuries and inflammation of the 
skin (i.e. wounds, rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, 
30 uticaria, eczema, photosensitivity, autoimmune disorders (i.e. lupus erythematosus, 
vitiligo, dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), 
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keloids, striae, erythema, petechiae, purpura, and xanthelasma. In addition, such 
disorders may predispose increased susceptibility to viral and bacterial infections of 
the skin (i.e. cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, 
boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, 
5 the protein product of this clone may also be useful for the treatment or diagnosis of 
various connective tissue disorders such as arthritis, trauma, tendonitis, 
chrondomalacia and inflammation, autoimmune disorders such as rheumatoid 
arthritis, lupus, scleroderma, and dermatomyositis as well as dwarfism, spinal 
deformation, and specific joint abnormalities as well as chondrodysplasias (i.e. 

10 spondyloepiphyseal dysplasia congenita, familial osteoarthritis, Atelosteogenesis type 
II, metaphyseal chondrodysplasia type Schmid). Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

15 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:72 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

20 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1674 of SEQ ID NO:72, b 
is an integer of 15 to 1688, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:72, and where b is greater than or equal to a 
+ 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: SNAAGNVVRAFLYINHLKL GCKVGLA (SEQ ID NO:276). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
30 It has been discovered that this gene is expressed primarily in prostate cancer 

and dendritic cells. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive, immune, and 
hematopoietic diseases, defects and/or disorders. Similarly, polypeptides and 
5 antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the endocrine and immune systems, 
. expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., reproductive, immune, hematopoietic, and cancerous 

10 and wounded tissues) or bodily fluids (e.g., lymph, seminal fluid, serum, plasma, 

urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

15 1 70 as residues : Trp-47 to Thr-54. 

The tissue distribution in prostate cells and tissues indicates that the protein 
products of this clone are useful for study, diagnosis and treatment of neoplasias, esp. 
of the prostate, and hormonal and metabolic disorders. Moreover, the protein product 
of this clone is useful for the treatment and diagnosis of hematopoietic related 

20 disorders such as anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia 
since stromal cells are important in the production of cells of hematopoietic lineages. 
The uses include bone marrow cell ex- vivo culture, bone marrow transplantation, 
bone marrow reconstitution, radiotherapy or chemotherapy of neoplasia. The gene 
product may also be involved in lymphopoiesis, therefore, it can be used in immune 

25 disorders such as infection, inflammation, allergy, immunodeficiency etc. In addition, 
this gene product may have commercial utility in the expansion of stem cells and 
committed progenitors of various blood lineages, and in the differentiation and/or 
proliferation of various cell types. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 

30 above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:73 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 124 of SEQ ID NO:73, b 
is an integer of 15 to 1 138, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:73, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

In specific embodiments, polypeptides of the invention comprise the following 
15 amino acid sequence: N W A VLNMLLS KG KITIFLGPLECGS (SEQ ID NO:277). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in B cell 
lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
20 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases, disorders, and/or defects, particularly cancers. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell ty pe(s). For a number of disorders 
25 of the above tissues or cells, particularly of the hemopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
30 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 
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The tissue distribution in B cell lymphoma suggests that the protein product of 
this clone would be useful for study and treatment of blood and immune disorders and 
neoplasias, esp. of the lymphatic system. The protein product of this clone is useful 
for the treatment and diagnosis of hematopoietic related disorders such as anemia, 
5 pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells are 

important in the production of cells of hematopoietic lineages. The uses include bone 
marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 

10 infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 

15 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 74 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 763 of SEQ ID NO:74, b 
is an integer of 15 to 777, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO: 74, and where b is greater than or equal to a 
+ 14. 



30 



FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

It has been discovered that this gene is expressed primarily in B cell 
lymphoma. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases, disorders, and/or defects, particularly cancer Similarly, polypeptides and 
5 antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hemopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
10 tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in B cell lymphoma suggests that the protein product of 
15 this clone would be useful for study and treatment of neplasias, esp. of lymphatic 
organs, and immune disorders. The protein product of this clone is useful for the 
treatment and diagnosis of hematopoietic related disorders such as anemia, 
pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells are 
important in the production of cells of hematopoietic lineages. The uses include bone 
20 marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 

reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
25 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:75 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 1046 of SEQ ID NO:75, b 
is an integer of 15 to 1060, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:75, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

The translation product of this gene shares sequence homology with a rat 
protein phosphatase, in addition to, a human heterogeneous nuclear ribonucleoprotein 
R (See Genbank Accession No.gil2697103 (AF000364)). When tested against PC 12 
cell lines, supernatants removed from cells containing this gene activated the EGR1 

15 (early growth response gene 1) promoter element. Thus, it is likely that this gene 

activates sensory neuron cells through the EGR1 signal transduction pathway. EGR1 
is a separate signal transduction pathway from Jak-STAT, genes containing the EGR1 
promoter are induced in various tissues and cell types upon activation, leading the 
cells to undergo differentiation and proliferation. This gene also showed activity in 

20 sensory neurons using the EGR assay described in the Example section. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PSHQTRKGKSAKLLDRPPEALRMKTtTTTLLLACHLQLEV 
GVVVGGEVD (SEQ ID N 0:278), 

FQASSANNQQNWGSQPIAQQPLQQGGDYSG 

25 NYGYNNDNQEFYQDTYGQQWK (SEQ ID NO:279), WXPLLXTSGS PGLXGFG 
TRMNGKEIEGEEIEIVLAKPPDKKRKERQAARQASRSTAYEDYYYHPPPRMPP 
PIRGRGRGGGRGGYGYPPDYYGYEDYYDDYYGYDYHDYRGGYEDPYYGYD 
DGYAVRGRGGGRGGRGAPPPPRGRGAPPPRGRAGYSQRGAPLGPPRGSRGG 
RGGPAQQQRGRGSRGSRGNRGGNVGGKRKADGYNQPDSKRRQPTTNRTGV 

30 PNPSLSSRFSKVVTILVTMVTIMTTRNFIRILMGNSGSRQVRA (SEQ ID 
NO:280), RMNGKEIEGEEIEIVLAKPPDKKRKER (SEQ ID NO:281), YYHPPP 
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RMPP PIRGRGRGGGRGGYG (SEQ ID NO:282), DYRGGYEDPYYGYDDGYAV 
RGRGGGR (SEQ ID NO:283), PPPRGRAGYSQRGAPLGPPRGSRGGRGG (SEQ 
ID NO:284), and/or ADGYNQPDSK RRQPTTNRTG VPNPSLS S RFS K V VT (SEQ 
ID NO:285). Polynucleotides encoding these polypeptides are also encompassed by 

5 the invention. The gene encoding the disclosed cDNA is believed to reside on 
chromosome L Accordingly, polynucleotides related to this invention are useful as 
a marker in linkage analysis for chromosome 1 . 

It has been discovered that this gene is expressed primarily in human primary 
breast cancer, lung, and leukocytes. 

10 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive, immune, or 
pulmonary diseases and/or disorders, particularly breast cancer. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive, 
immune and respiratory systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., reproductive, 
immune, pulmonary, and cancerous and wounded tissues) or bodily fluids (e.g., 

20 lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in breast cancer cells and tissues, in addition to immune 
cells, combined with the homology to a protein phosphatase suggests that the protein 

25 product of this clone would be useful for diagnosis and treatment of breast cancer and 
abnormalities of the lung and the immune system. Morever, the expression of this 
gene product suggests a role in regulating the proliferation; survival; differentiation; 
and/or activation of hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 

30 presentation, or other processes suggesting a usefulness in the treatment of cancer 
(e.g. by boosting immune responses). 
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Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
5 inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 

10 rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 

15 the differentiation and/or proliferation of various cell types. The protein is useful in 
modulating the immune response to aberrant cells and cell types, particularly 
proliferative cells (e.g. protein may increase the immunogenicity of tumor antigens 
either directly or indirectly, or may activate apoptosis). The protein is useful in 
treating, detecting, and/or preventing various pulmonary disorders, which include, but 

20 are not limited to, ARDS, emphysema, and cystic fibrosis. Protein, as well as, 

antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:76 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 1489 of SEQ ID NO:76, b 
is an integer of 15 to 1503, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:76, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino a c i d seq uence: LQIPPSSQSLGLKNADS SI (SEQ ID NO:286), GGPPESAPW 
LPAVLRAPVLTSRCASSDSEGPVWFCQPGSGPSSTEMSCHCILGPGSSCLCVL 
RGSMWTPSVPGWPQPAKETGASSCSVFSANNGSCPLPLHNHQRQASLDTGL 
SLEHVPGESYFYSPVG (SEQ ID NO:287), SSDSEGPVWFCQPGSGPSSTEMSC 
10 HCILGPGSSC (SEQ ID NO:288), WTPSVPGWPQPAKETGASSCSVFSANNG 
(SEQ ID NO:289), and/or QRQASLDTGL SLEHVPGES YF (SEQ ID NO:290). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human B cell 
lymphoma. 

15 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune or hematopoietic diseases 
and/or disorders, particularly B cell lymphoma. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 

20 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

25 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in B-cell lymphoma suggests that the protein product of 
this clone would be useful for diagnosis and treatment of immune or hematopoietic 
diseases and/or disorders, particularly proliferative conditions. Morever, the 

30 expression of this gene product suggests a role in regulating the proliferation; 

survival; differentiation; and/or activation of hematopoietic cell lineages, including 
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blood stem cells. This gene product may be involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
5 product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
10 transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 

15 blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. The uses include bone marrow cell ex- vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 

20 neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 

25 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:77 and may have been publicly available prior to conception of 

30 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
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would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 858 of SEQ ID NO:77, b 
is an integer of 15 to 872, where both a and b correspond to the positions of 
5 nucleotide residues shown in SEQ ID NO:77, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

In specific embodiments, polypeptides of the invention comprise the following 
10 amino acid sequence: SSSLVLTIRSQTLFLASFIHSTSIFCALN (SEQ ID NO:29 1). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in osteoarthritic 
cartilage. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: osteoarthritis and other 
bone/cartilage disorders, particularly degenerative conditions. Similarly, polypeptides 
and antibodies directed to those polypeptides are useful to provide immunological 
probes for differential identification of these tissue(s) or cell type(s). For a number of 
20 disorders of the above tissues or cells, particularly of the skelatal system, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., skeletal, joint, autoimmune, and cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
25 level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

The tissue distribution in osteoarthritic cartilage suggests that the protein 
product of this clone would be useful for the diagnosis, treatment, and/or prevention 
of osteoarthritis. Moreover, the gene product is useful in the detection and treatment 
30 of disorders and conditions affecting the skeletal system, in particular osteoporosis, 
bone cancer, as well as, disorders afflicting connective tissues (e.g. arthritis, trauma, 
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tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment 
of various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal dysplasia 
5 congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 

chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:78 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 559 of SEQ ID NO:78, b 
is an integer of 15 to 573, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:78, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
17. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 17. 

25 It has been discovered that this gene is expressed primarily in fetal brain, 

pharynx carcinoma, and Hodgkin's lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental and/or proliferative 

30 diseases and disorders, particularly pharynx carcinoma, and Hodgkin's lymphoma. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
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provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive and immune systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., developmental, 
5 proliferative cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, amniotic fluid, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

10 176 as residues: Tyr-30 to Ser-40. 

The tissue distribution in pharynx carcinoma and Hodgkin's lymphoma 
suggests that the protein product of this clone would be useful for diagnosis and 
treatment of immune and proliferative conditions. Moreover, expression within fetal 
tissue and other cellular sources marked by proliferating cells suggests this protein 

15 may play a role in the regulation of cellular division, and may show utility in the 
diagnosis and treatment of cancer and other proliferative disorders. Similarly, 
developmental tissues rely on decisions involving cell differentiation and/or apoptosis 
in pattern formation. Dysregulation of apoptosis can result in inappropriate 
suppression of cell death, as occurs in the development of some cancers, or in failure 

20 to control the extent of cell death, as is believed to occur in acquired 

immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein 

25 may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 

Alternatively, the protein product of this clone is useful for the detection, 
treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 

30 Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
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trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:79 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1495 of SEQ ID NO:79, b 
is an integer of 15 to 1509, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:79, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

The translation product of this gene shares sequence homology with insulin- 
like growth factor binding protein. Moreover, the protein has homology to the human 
Slit-1 protein (See Genbank Accession No. gnllPIDId 1036 170 (AB017167)), which is 
thought to play an integral role in neural development. In Drosophila embryogenesis, 
the slit gene has been shown to play a critical role in CNS midline formation. Each 
Slit gene encodes a putative secreted protein, which contains conserved protein- 
protein interaction domains including leucine-rich repeats (LRR) and epidermal 
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growth factor (EGF)-like motifs, like that of the Drosophila protein. The Slit genes 
form an evolutionary conserved group in vertebrates and invertebrates, and the 
mammalian Slit proteins may participate in the formation and maintenance of the 
nervous and endocrine systems by protein-protein interactions. 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: the EGF-like domain: CCCRLGLSGPKC (SEQ ID NO:292); in 
addition to the following: R AFWGLG ALQLLDLS ANQLE AL (SEQ ID NO:293), 
HASGRRTGSADDGLQGRTGSGPPTAGAGGGGAAP (SEQ ID NO:294), 
VSAAAGARLAPRAPGAPAGCRPMRGCAARAAARKSLVPVLPAGWRSGPAA 

10 AARPGPRRLAHAPSAARSRAGPGAVARPLPRRHLAAAHGRGCGPAAARAGA 
GSGPGARRAARVPTAGRPPGTHVHTSGQSGAPRDPEGEALADTWAQTGQGD 
SSSNSSSSGRGRDQEGPRMGAAPPPPAPAVGGPLPVRPWSPSSAEPVLRPDAW 
(SEQ ID NO:295), 

TRPAAERAPRTTGSRDAQAAGLPPRVPGAGGLPPCGALPGR 

1 5 GLGRCCCCCCCCRLGLSGPKCRPGPRPRGPW APRT APRC AR ACRE ACQLS AL 
SLPAVPPGLSLRLRALLLDHNRVRALPPGAFAGAGALQRLDLRENGLHSVHV 
RAFWGLGALQLLDLSANQLEALAPGTFAPLRALRNLSLAGNRLARLEPAALG 
ALPLLRSLSLQDNELAALAPGLLGRLPALDALHLRGNPWGCGCALRPLCAWL 
RRHPLPASEAETVLCVWPGRLTLSPLTAFSDAAFSHCAQPLALRDLARGLHA 

20 RAGLLPRQPGFLPGAGLWAHRLPCAPPPPPHRRPPPAETVQTRTPIPTPTAVPR 
PRTRG APS A A AQ A (SEQ ID NO:296), 

GCRPMRGCAARAAARKSLVPVLPAGWRSGP AAAARPGPRRLAHAPSA (SEQ 
ID NO:297), PGAVARPLPRRHLAAAHGRGCG PAAARAGA (SEQ ID NO:298), 
SGQSGAPRDPEGEALADTWAQTGQ (SEQ ID NO:299), 

25 PPAPAVGGPLPVRPWSPSSAEPV (SEQ ID NO:300), APRTTGSRD 
AQAAGLPPRVPGAGGLP (SEQ ID NO:301), GPRPRGPWAPRTAPRCARACRE 
(SEQ ID NO:302), AVPPGLSLRLRALLLDHNRVRALPPGAFAGA (SEQ ID 
NO:303), LGALQLLDLSANQLEALAPGTFAP (SEQ ID NO:304), PPGAFAGAG 
ALQRLDLRENGLHSVHVRAFWGLGALQ (SEQ ID NO:305), RNLSLAGNRLA 

30 RLEPAALGALPLLRSLS (SEQ ID NO:306), LPALDALHLRGNPWGCGCALRP 
LCAW (SEQ ID NO:307), T VLC V WPGRLTLSPLTAFS D A AFS HC AQPL ALRD 
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(SEQ ID NO:308), LHARAGLLPRQPGFLPGAGLWAHR (SEQ ID NO:309), 
and/or TVQTRTPIPTPTAVPRPRTRGAPS (SEQ ID NO:310). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in a breast cancer 
5 cell line, MDA36. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neural, reproductive, and 
proliferative diseases and/or disorders, particularly breast cancer and degenerative 

10 conditions. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., neural, reproductive, and 

15 proliferative cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

20 177 as residues: Met-1 to Arg-10, Arg-64 to Ala-71, Gly-124 to Gly-131, Pro-189 to 
Arg-194, Val-223 to Gly-228. 

The tissue distribution in a breast cancer cells and tissues and homology to 
insulin-like growth factor binding protien suggests that the protein product of this 
clone would be useful for diagnosis and treatment of breast cancer, and other forms of 

25 cancer. Moreover, the homology to the conserved human slit- 1 protein suggests that 
the protein is useful in the treatment, diagnosis, and/or prevention of neural disorders, 
particularly developmental and degenerative conditions. Similarly, the protein is 
useful for the treatment and/or diagnosis of neurodegenerative disease states, 
behavioral disorders, or inflammatory conditions which include, but are not limited to 

30 Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
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trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
5 patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
10 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:80 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1095 of SEQ ID NO:80, b 

20 is an integer of 15 to 1 109, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:80, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: HASGRPDRSSAPIGNSGLPCPDLEPLGGLQSKCRLCAPTE 
ARGLWSRSLCSDRCDTWRS (SEQ ID NO:31 1), and/or GLPCPDLEPLGGLQSK 
CRLCAPTEARGLW (SEQ ID NO:312). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. This gene also maps to 

30 chromosome 1 , and therefore can be used in linkage analysis as a marker for 
chromosome 1. 
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It has been discovered that this gene is expressed primarily in salivary gland 
and colon carcinoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: colon carcinoma and other 

digestive system or gastrointestinal diseases and/or disorders. Similarly, polypeptides 
and antibodies directed to those polypeptides are useful to provide immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the digestive system, expression 
10 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., digestive system, gastrointestinal, metabolic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, chyme, bile, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
15 tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
178 as residues: Val-34 to Leu-39, Ser-64 to Cys-74, Ser-86 to Ser-95, Arg-128 to 
Ala- 136. 

The tissue distribution in salivary gland and colon carcinoma suggests that the 
20 protein product of this clone would be useful for the treatment and diagnosis colon 
cancer and other digestive system diseases and/or disorders, such as ulcers, and other 
proliferative conditions. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 81 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 793 of SEQ ID NO:81, b 
is an integer of 15 to 807, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:81, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: QEWESELGERRKPLQA (SEQ ID NO:313). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
10 It has been discovered that this gene is expressed primarily in 6 week old 

human embryos. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: embryological defects; aberrant 

15 development; aberrant cellular proliferation (e.g. cancers), and other developmentally 
related or proliferative diseases and/or disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the developing human embryo, expression 

20 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., developmental, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 

25 having the disorder. 

The tissue distribution in 6 week old human embryos suggests that the protein 
product of this clone would be useful for the diagnosis and/or treatment of defects in 
embryonic development. Elevated expression of this gene product in early 6 week 
human embryos suggests that this gene product plays a critical role in normal human 

30 development. Alternatively, this gene product may be involved in the pattern of 
cellular proliferation that accompanies early embryogenesis. Thus, aberrant 
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Expression of this gene product in tissues - particularly adult tissues - may 
correlate with patterns of abnormal cellular proliferation, such as found in various 
cancers. Moreover, this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
5 disorders. Similarly, developmental tissues rely on decisions involving cell 

differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 

10 muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 

15 Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:82 and may have been publicly available prior to conception of 

20 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1029 of SEQ ID NO:82, b 

25 is an integer of 15 to 1043, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:82, and where b is greater than or equal to a 
+ 14. 



30 



FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: CQSSNLIFFQFVNILFNLMMDILVDFSITKMPINSIFSLYF 
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CYEII (SEQ ID NO: 3 14). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in 6 week old 
human embryo. 

5 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: abnormal embryonic 
development; abnormal cellular proliferation; developmental defects, and other 
developmentally related or proliferative diseases and/or conditions. Similarly, 

10 polypeptides and antibodies directed to those polypeptides are useful to provide 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the developing 
human embryo, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., developmental, and cancerous and 

15 wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in 6 week old human embryo suggests that the protein 

20 product of this clone would be useful for the diagnosis and treatment of disorders of 
human embryonic development. Expression of this clone in developing embryos 
suggests that it plays a critical role in early human development. Alternatively, it may 
be involved in key cellular proliferation events that occur during embryogenesis. 
Therefore misexpression of this gene in adult tissues may lead to abnormal patterns of 

25 cellular proliferation and cancer. Moreover, expression within embryonic tissue and 
other cellular sources marked by proliferating cells suggests this protein may play a 
role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, developmental tissues 
rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 

30 Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
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death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
5 of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:83 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 159 of SEQ ID NO:83, b 
is an integer of 15 to 1 173, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 83, and where b is greater than or equal to a 

20 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: GPVWLFCFLTLCRKPSQLFSQENSCMDVAGGVTTCLPP 

25 WFSRGAPAQMSQWPPSSDHGAVRAGRDSRVGPVQPSHLTCEGGKEEREKNK 
KAEVNPPTGMGLANRIPRDDITLKLRNQGKLRTKENRTQSAKRHP (SEQ ID 
NO:315), VACKPENRTKTHFASSPACDGHALGGQVGFAICFLSCLFPPM (SEQ 
ID NO:316), and/or SHPMPNTPQKQLLFSEDNELLVSLRTGRKPTLQAALRVTG 
(SEQ ID NO:317). Polynucleotides encoding these polypeptides are also 

30 encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in pleural cancer 
and endometrial tumors, and, to a lesser extent, in bone marrow & apoptotic T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: pleural cancer; endometrial 

tumors; hematopoietic disorders; immune dysfunction. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the lungs and immune system, expression 

10 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., immune, hematopoietic, reproductive, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 

15 having the disorder. 

The tissue distribution in pleural cancer and endometrial tumors indicates that 
the protein products of this clone are useful for the diagnosis and treatment of various 
reproductive cancers, including pleural cancer and endometrial tumors. In addition, 

Expression of this gene product within T cells & bone marrow suggests that it 

20 may play a role in normal hematopoiesis. Therefore, this gene product may also be 
useful in the diagnosis and/or treatment of a variety of hematopoietic disorders, 
including defects in immune surveillance, inflammation, impaired immune function, 
and T cell lymphomas. Use of this gene product may be appropriate in situations 
designed to affect the proliferation, survival, and/or differentiation of various 

25 hematopoietic cell lineages, including blood stem cells. 

Moreover, this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 

30 result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
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acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:84 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 547 of SEQ ID NO: 84, b 
is an integer of 15 to 1561, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:84, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

The translation product of this gene shares low sequence homology with dreg- 
2, a gene product originally identified in Drosophila that shows an oscillating pattern 
of expression tied into a circadian clock rhythm. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

AHRLQIRLLTWDVKDTLLRLRHPLGEAYATKARAHGLEV 
EPSALEQGFRQAYRAQSHSFPNYGLSHGLTSRQWWLDVVLQTFHLAGVQDA 
QAVAPIAEQLYKDFSHPCTWQVLDGAEDTLRECRTRGLRLAVISNFDRRLEGI 
LXGLGLREHFDFVLTSEAAGWPKPDPRIFQEALRLAHMEPVVAAHVGDNYL 
CDYQGPRAVGMHSFLVVGPQALDPVVRDSVPKEHILPSLAHLLPALDCLEGS 
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TPGL (SEQ ID N 0:319), 

EGDPRGRPRPRPLGPPPQLTLFTALXDILRQVRAPGLRLSRA 
LE VGRKGS PIFKIQI YL (SEQ ID NO:318), IRLLTWDVKDTLLRLRHPLGEAYA 
TKA (SEQ ID NO:320), LEQGFRQAYRAQSHSFPNYGLSHG (SEQ ID NO:321), 
5 HLAGVQDAQAVAPIAEQLYKDFSHPC (SEQ ID NO:322), VLDGAEDTLRECR 
TRGLRLAVIS (SEQ ID NO:323), REHFDFVLTSEAAGWPKPDPRIFQEA (SEQ 
ID NO:324), EPVVAAHVGDNYLCDYQGPRAVGMHSFL (SEQ ID NO:325), 
and/or VVRDSVPKEHILPSLAHLLPALD (SEQ ID NO:326). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

10 It has been discovered that this gene is expressed primarily in tumors of the 

pancreas & thymus and to a lesser extent in a variety of fetal tissues, including fetal 
brain, liver, spleen, and kidney. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

15 diagnosis of the following diseases and conditions: pancreatic cancer; thymic cancer; 
disorders of fetal development; abnormal cellular proliferation; hematopoietic 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

20 the pancreas and immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., developmental, 
metabolic, immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 

25 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in proliferative and developmental cells and tissues 
indicates that the protein products of this clone are useful for the diagnosis and 
treatment of cancers, particularly pancreatic and thymic cancer. Expression of this 

30 gene product within various fetal tissues also indicates that it is useful in the diagnosis 
and/or treatment of human developmental disorders. Taken together, the observation 



WO 99/47540 



PCTAJS99/05804 



137 



that this gene product is expressed in cancers and in fetal tissues indicates that it plays 
a role in proliferation and/or differentiation events that are associated with early 
development. Misexpression of this gene product in adult tissues, therefore, may 
directly contribute to abnormal cellular proliferation and/or dedifferentiation that 
5 accompanies cancer. Finally, 

Moreover, the expression of this gene product in fetal liver/spleen also 
suggests that it plays a role in hematopoiesis, and is useful in the diagnosis and/or 
treatment of a variety of disorders of the immune system. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

10 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 85 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

15 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1419 of SEQ ID NO:85, b 
is an integer of 15 to 1433, where both a and b correspond to the positions of 

20 nucleotide residues shown in SEQ ID NO:85, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

In specific embodiments, polypeptides of the invention comprise the following 
25 amino acid sequence: IRKLGPGLAPCSCRSGQVFPRV (SEQ ID NO:327). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in frontal cortex, 
particularly derived from epileptic patients. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: epilepsy; neurodegenerative 
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diseases and disorders, particularly learning disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain, CNS, and/or PNS, expression of 
5 this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

10 The tissue distribution in frontal cortex tissue suggests that the protein product 

of this clone would be useful for the diagnosis and/or treatment of disorders of the 
brain and nervous system, particularly epilepsy. Moreover, the expression of this gene 
product suggests that it may play a role in various critical processes of the nervous 
system, including nerve survival, pathfinding, signal conductance, and/or synapse 

15 formation. It may have effects on various processes including homeostasis, learning, 
motor function, language, etc. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 

20 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 86 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1363 of SEQ ID NO:86, b 
is an integer of 15 to 1377, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO:86, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

5 KPLRMARPGGPEHNEYALVSAWHSSGSYLDSEGLRHQDD 

FDVSLLVCHCAAPFEEQGEAERHVLRLQFFVVLTSQRELFPRLTADMRRFRK 
PPRLPPEPEAPGSSAGSPGEASGLILAPGPAPLFPPLAAEVGMARARLAQLVRL 
AGGHCRRDTLWKRLFLLEPPGPDRLRLGGRLALAELEELLEAVHAKSIGDIDP 
QLDCFLSMTVSWYQSLIKVLLSRFPRAVAISKAQTWELSTWLR (SEQ ID 
10 NO:328), ARGTLELPTPLIAAHQLYNYVADHASSYHM (SEQ ID NO:329), 
SHCEWPGQG AQNTTSMPWCRHGTVLAPTWTLRDFDTR (SEQ ID NO:330), 
PLTTVSHLCPL 

SLRVFTSHLDITAGHSHRDDTWVPIPALPLKHLRPPSSPFALGPWVSHPLMRW 
VQKLSHLHSNPGTGFSMGGKSAEKLKC (SEQ ID NO:331), STAARGAPGPGR 

1 5 AGGTPRSSPCQIHWGHRPPAGLLPIHDGLLVPEPDQSSPKPLPQSCRHFQSPDL 
GTQYLVALNQKFTDCSALVFWTPLRKDVSEVVFREALPVQPQDTRSPPAQLV 
STYHHLESVINTACFTLLDPPPLKGVDWTTECHCSLNHGPTRLPARGRTDQPF 
WAPGQARH (SEQ ID NO:332), 

HQRLCNYVLRVCCPSLAAGTALPKHPQPLTHPGL 

20 QRVRSTPRTPWALLGYSFRPPW (SEQ ID NO:333), 
PGGPEHNEYALVSAWHSS GSYLDSEGLR (SEQ ID NO:334), 
D VSLLVCHC A APFEEQGE AERH VLR (SEQ ID NO:335), 
RLTADMRRFRKPPRLPPEPEAPGSSAGS (SEQ ID NO:336), GEASGLI 
LAPGPAPLFPPLAAEVGM (SEQ ID NO:337), 

25 TLWKRLFLLEPPGPDRLRLGGRL (SEQ ID NO:338), and/or 
LAELEELLEAVHAKSIGDIDPQLDCFLS (SEQ ID NO:339). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fetal liver/spleen 
and leukocytes, and to a lesser extent in a colon adenocarcinoma cell line. 

30 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
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diagnosis of the following diseases and conditions: hematopoietic disorders; immune 
dysfunction; colon cancer; colorectal adenocarcinoma. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
5 of the above tissues or cells, particularly of the immune system and colon, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., hematopoietic, immune, gastrointestinal, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 

10 gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
184 as residues: Leu-16 to Ser-23, Ser-38 to Pro-43, Gly-53 to Leu-60. 

The tissue distribution in colon adenocarcinoma suggests that the protein 

15 product of this clone would be useful for the diagnosis and/or treatment of 

gastrointestinal diseases and/or disorders, particularly proliferative conditions. 
Expression of this gene product in fetal and proliferative cells and tissues suggests 
that it may be a marker cancers, and that it's misregulated expression may in fact 
contribute to the development or progression of the types of cancers dictated by its 

20 expression. 

Similarly, the expression of this gene product in fetal liver/spleen - a primary 
site of early hematopoiesis - taken together with its expression in peripheral blood 
leukocytes suggests that this gene product may play a role in a variety of 
hematopoietic processes, including the survival, proliferation, activation, and/or 

25 differentiation of all blood cell lineages, including the totipotent hematopoietic stem 
cell. Such a gene product may therefore play a role in a variety of hematopoietic 
disorders including inflammation; immune dysfunction; defects in immune 
surveillance; and hematopoietic cancers and lymphomas. Similarly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 

30 formation. Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
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of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and would be useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:87 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1701 of SEQ ID NO:87, b 
is an integer of 15 to 1715, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:87, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
20. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 20. 

It has been discovered that this gene is expressed primarily in brain. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neurodegenerative diseases and/or 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
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or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the central nervous system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., neural, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
5 spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. This gene is believed to reside on chromosome 20, D20S 1 1 1- 
D20S195. Polynucleotides corresponding to this gene are useful, therefore, as 
chromosome markers. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

185 as residues: Met-1 to Tyr-6, Thr-38 to Ala-44. 

The tissue distribution in brain tissue indicates that the protein products of this 
clone are useful for diagnosis and treatment of disorders of the central nervous 
system. Moreover, the protein product of this clone is useful for the detection, 

15 treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 

20 aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. 

In addition, elevated expression of this gene product in regions of the brain 

25 suggests it plays a role in normal neural function. Potentially, this gene product is 
involved in synapse formation, neurotransmission, learning, cognition, homeostasis, 
or neuronal differentiation or survival. Protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:88 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 403 of SEQ ID NO:88, b 
is an integer of 15 to 417, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:88, and where b is greater than or equal to a 
+ 14. 

TO 

FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

When tested against U937 cell lines, supernatants removed from cells 
containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates myeloid cells, and to a lesser extent, 

15 other immune and hematopoietic cells or cell types, through the JAK-STAT signal 
transduction pathway. GAS is a promoter element found upstream of many genes 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 

20 element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: FQLYFNPELIFKHFQIWRLITNFLFFGPVGFNFLFNMIFLY 
RYCRMLEEGSFRGRTADFVFMFLFGGFLMTLFGLFVSLVFLGQAFTIMLVYV 

25 WSRXNPYVRMNFFGLLNFQAPFLPWVLMGFSLLLGNSIIVDLLGIAVGHIYFF 
LEDVFPNQPGGIRILKTPSILKAIFDTPDEDPNYNPLPEERPGGFAWGEGQ SEQ 
ID N O 3 4 0 ) , 

GVGQATVGKMAYQSLRLEYLQIPPVSRAYTTACVLTTAAVQLELITPF 
QLYFNPELIFKHFQrWRLITNFLFFGPVGFNFLFNMIFLYRYCRMLEEGSFRGR 

30 TADFVF (SEQ ID NO:341), LIFKHFQIWRLITNFLFFGPVGF (SEQ ID NO:342), 
FLYRYCRMLEEGSFRGRTADFVFMF (SEQ ID NO:343), LVFLGQAFTIMLVYV 
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WSRXNPYV (SEQ ID NO:344), VLMGFSLLLGNSIIVDLLGIA (SEQ ID NO:345), 
NQPGGIRILKTPS ILK AIFDTPDED (SEQ ID NO:346), RLEYLQIPPVSRAYTTAC 
VLTTAAVQLE (SEQ ID NO:347), and/or RLITNFLFFGPVGFNFLFNMIFLYRYC 
RMLE (SEQ ID NO:348). Polynucleotides encoding these polypeptides are also 
5 encompassed by the invention. The gene encoding the disclosed cDNA is believed to 
reside on chromosome 17. Accordingly, polynucleotides related to this invention are 
useful as a marker in linkage analysis for chromosome 17. 

It has been discovered that this gene is expressed primarily in smooth muscle, 
fetal brain, fetal liver and to a lesser extent in activated macrophage, colon cancer. 

10 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental diseases, immune- 
related diseases, neural disorders, and vascular diseases and conditions. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system 
and central nervous system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., developmental, vascular, 
immune, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 

20 plasma, amniotic fluid, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in fetal liver, macrophage, and fetal brain indicates that 
the protein products of this clone are useful for treating and diagosis of immune 

25 system-related diseases and CNS diseases. Moreover, the protein product of this clone 
is useful for the treatment and diagnosis of hematopoietic related disorders such as 
anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells 
are important in the production of cells of hematopoietic lineages. The uses include 
bone marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 

30 reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
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infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Alternatively, the protein is useful in the detection, treatment, 
5 and/or prevention of vascular conditions, which include, but are not limited to, 
microvascular disease, vascular leak syndrome, aneurysm, stroke, atherosclerosis, 
arteriosclerosis, or embolism. 

Moreover, the expression within fetal tissue and other cellular sources marked 
by proliferating cells, combined with the GAS biological activity, suggests this 

10 protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Similarly, 
developmental tissues rely on decisions involving cell differentiation and/or apoptosis 
in pattern formation. Dysregulation of apoptosis can result in inappropriate 
suppression of cell death, as occurs in the development of some cancers, or in failure 

15 to control the extent of cell death, as is believed to occur in acquired 

immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein 

20 may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

25 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 89 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

30 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
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are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 153 of SEQ ID NO:89, b 
is an integer of 15 to 1 167, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 89, and where b is greater than or equal to a 
5 + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

The translation product of this gene shares sequence homology with 
proacrosin binding proteins (sp32) from non-human mammalian species. The binding 

10 of sp32 to proacrosin may be involved in packaging the acrosin zymogen into the 
acrosomal matrix. See, for example, J Biol Chem. 1994 Apr 1; 269(13): 10133- 
10140, incorporated herein by reference. Accordingly, the inventors have termed the 
translation product of this gene human sp32 or "h-sp32". Contact of cells with 
supernatant expressing the product of this gene has been shown to increase the 

15 permeability of the plasma membrane of PMN to calcium. Thus it is likely that the 

product of this gene is involved in a signal transduction pathway that is initiated when 
the product binds a receptor on the surface of the plasma membrane of both 
neutrophils, and to a lesser extent in other immune and hematopoietic cells. Thus, 
polynucleotides and polypeptides have uses which include, but are not limited to, 

20 activating 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: HASAGPDGSSPA (SEQ ID NO:349), 
ELLLEKPKPWQPPAAAPHRALLVLCYSIVENTCHTPTAKAWKYMEEEILGFG 
KSVCDSLGRRHMSTCALCDFCSLKLEQCHSEASLQRQQCDTSHKTPFAAPCL 
25 PPRACPSATR (SEQ ID NO:350), 

LPGWGFPTKICDTDYIQYPNYCSFKSQQCLMR 

NRNRKVSRMRCLQNETYSALSPGKSEDVVLRWSQEFSTLTLGQFG (SEQ ID 
NO: 35 1), SPVLLPAFPPLPVPLLALPVSAPLPACVLVSAPACAPLLAPACAL 
ALAPGFPGTRRIVGALPRCC (SEQ ID NO:352), LLVLCYSIVENTCIITPTAK 
30 AWKYMEEEILGFGKS (SEQ ID NO:353), and/or LKLEQCHSEASLQRQQC 
DTSHKTPFA (SEQ ID NO:354). Polynucleotides encoding these polypeptides are 
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also encompassed by the invention. The gene encoding the disclosed cDNA is 
believed to reside on chromosome 12. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 12. 

It has been discovered that this gene is expressed primarily in testis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive disorders. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
diseases, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., reproductive, testis, prostate, epidiymus, 
and cancerous and wounded tissues) or bodily fluids (e.g., lymph, seminal fluid, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. This gene is 
believed to map to chromosome 12 and is thought to be useful as a chromosome 
marker. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
187 as residues: Asp-27 to Ser-32, Pro-52 to Thr-58, Arg-63 to Asn-70, Gln-78 to 
Gly-83, Thr-107 to Asn-1 13, Thr-160 to Val-176, Ser-188 to Gly-241, Leu-248 to 
Pro-265, Tyr-302 to Gly-314. 

The tissue distribution in testis, combined with the specific homology to the 
sp32 protein indicates that the protein products of this clone are useful for the 
diagnosis, treating, and/or prevention of reproductive diseases and/or disorders. 
Moreover, polynucleotides and polypeptides corresponding to this gene are useful for 
the treatment and diagnosis of conditions concerning proper testicular function (e.g. 
endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This gene product is 
also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. 
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Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that may be expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product may be expressed in other specific tissues or organs 
5 where it may play related functional roles in other processes, such as hematopoiesis, 
inflammation, bone formation, and kidney function, to name a few possible target 
indications. The protein is useful in application and utility as a contraceptive, either 
directly or indirectly. Based upon the detected calcium flux activity, the protein may 
also be useful as an effect treatment for infertility (i.e. for inhibiting autoimmune 
10 disorders). Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:90 and may have been publicly available prior to conception of 
1 5 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1878 of SEQ ID NO:90, b 
20 is an integer of 15 to 1892, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:90, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

25 The translation product of this contig has consistent sequence homology with 

a number of previously described viral tat proteins (see, for example, Stevens, et al., J. 

Virol. 64:3716-3725 (1990), which is hereby incorporated by reference, herein). 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: QVSGLILSLSCGMDGLALDGSPSPSPXTEKAGRCISQTSL 
30 (SEQ ID NO:355), QVSGLILSLSCGMDGLALDGSPSPSPXTEKAGRCISQTSLP 

GKWEV (SEQ ID NO:356), RASKTVPRMPPNWPAKMPCLCHIRTVEHLGTIS 
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SGAPGRPTGQQAARTYHICWIHPGQKIDSLPPSSQHPRSQQLAPGTWPSTSTT 
KPAEETLGSSASLPISQARKSEKCTFQPSPWXVRGKESHQVPAHPSHRTETES 
D H S P VRKPPS RGTRTGDFT VGD WS E A WLLEL ALL (SEQ ID NO:357), RMPPN 
WPAKMPCLCHIRTVEHLG (SEQ ID NO:358), GRPTGQQAARTYHICWIHPG 

5 QKIDS (SEQ ID NO:359), WPSTSTTKPAEETLGSSASLPISQA (SEQ ID NO:360), 
KSEKCTFQPSPWXVRGKESHQVP (SEQ ID NO:361), and/or KPPS RGTRTGDF 
TVGDWSEAWLLE (SEQ ID NO:362). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

It has been discovered that this gene is expressed almost exclusively in 

10 neutrophils. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of immune disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 

15 identification of the tissue(s) or cell type(s). For a number of disorders of the immune 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 

20 gene expression level, i.e., the expression level in healthy tissue from an individual 

not having the disorder. In addition, molecules of the present invention can be used to 
regulate transcription and translation of genes in cells of the immune system, as well 
as in other cell types. Such transcriptional and translation regulation is useful for 
diagnosing and treating a number of disorders in which an alterred state of 

25 transcription and translation may be a factor in the disorder. Such disorders include 
many viral infections, particularly of immune cells, including HIV-1, HIV-2, human 
T-cell lymphotropic virus (HTLV)-I, and HTLV-II, as well as other DNA and RNA 
viruses such as herpes simplex virus (HSV)-l, HSV-2, HSV-6, cytomegalovirus 
(CMV), Epstein-Barr virus (EBV), herpes samirii, adenoviruses, rhinoviruses, 

30 influenza viruses, reoviruses, and the like. In addition, the ability to use molecules of 
the present invention to molecularly regulate the processes of transcription and 
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translation is useful in the diagnosis and treatment of many types of cancers, 
particularly those of the immune system, including ovarian cancer, breast cancer, 
colon cancer, cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, 
glioblastoma, lung cancer, intestinal cancer, testicular cancer, stomach cancer, 
5 neuroblastoma, myxoma, myoma, lymphoma, endothelioma, osteoblastoma, 
osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, and the like. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
188 as residues: Gln-2 to Trp-12, Ala-30 to Glu-35, Gln-42 to Ser-51. 

The tissue distribution in neutrophils, combined with the homology to viral tat 
10 proteins suggests that the protein product of this clone is useful for the diagnosis and 
treatment of immune disorders, particularly viral infections and proliferative 
disorders. Further, since this clone has a high degree of sequence relatedness to 
factors which are involved in the regulation of transcription and translation, this clone 
is useful as a regulator of such processes. Protein, as well as, antibodies directed 
15 against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:91 and may have been publicly available prior to conception of 
20 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 509 of SEQ ID NO:91, b 
25 is an integer of 15 to 523, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:91, and where b is greater than or equal to a 
+ 14. 



30 



FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

The translation product of this contig has clear sequence identity with a 
number of thioredoxins and endoplasmic reticulum resident proteins (see, for 
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example, Shorrosh and Dixon, Plant J. 2:51-58 (1992), which is hereby incorporated 
by reference, herein). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PCADCLSAWA (SEQ ID NO:363). Polynucleotides encoding 
5 these polypeptides are also encompassed by the invention/The gene encoding the 
disclosed cDNA is believed to reside on chromosome 5. Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 5. 

It has been discovered that this gene is expressed primarily in adipocytes and 
10 striatum depression, and in lower abundance in prostate, whole brain, fetal liver, and 
spleen. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: Prostate cancer, CNS diseases, 

15 immune disorders . Similarly, polypeptides and antibodies directed to those 

polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., neural, hematopoietic, 

20 immune, and cancerous and wounded tissues) or bodily fluids (e.g., seminal fluid, 
amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
Since the translation product of this clone has a high degree of sequence relatedness 

25 to many thioredoxins, it can be used as a food additive to improve flour quality or to 
suppress the anti-nutritional effects of leguminous plants. Molecules of the present 
invention can further used to inactivate toxins, for example, bee or snake venom. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
189 as residues: Trp-43 to Ala-49, Pro-68 to Ala-74, Glu-100 to Gly-1 1 1, Glu-120 to 

30 Asn-125, Pro-141 to Ala-154, Asp-157 to Lys-171, Cys-177 to Ile-182, Ser-248 to 
Leu-253, Thr-280 to Glu-285, Gly-353 to Val-359. 
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The tissue distribution in whole brain suggests that the protein product of this 
clone would be useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 

5 Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 

10 disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. The secreted protein can also be used to determine 

15 biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, and as nutritional 
supplements. It may also have a very wide range of biological activities. Typical of 
these are cytokine, cell proliferation/differentiation modulating activity or induction 
of other cytokines; immunostimulating/immunosuppressant activities (e.g. for treating 

20 human immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 

25 hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's disease); as 
antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 

30 directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:92 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1368 of SEQ ID NO:92, b 
is an integer of 15 to 1382, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:92, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

When tested against TF-1 cell lines, supernatants removed from cells 

15 containing this gene activated the ISRE (interferon-sensitive responsive element ) 
promoter element. Thus, it is likely that this gene activates myeloid cells, and to a 
lesser extent, in immune and hematopoietic cells or tissues, through the JAK-STAT 
signal transduction pathway. ISRE is a promoter element found upstream in many 
genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a 

20 large, signal transduction pathway involved in the differentiation and proliferation of 
cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of the 
ISRE element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 

25 amino acid sequence: HASGYLCIVLL (SEQ ID NO:364). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed exclusively in Rejected 
Kidney. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: kidney and other urinary tract 
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disorders and disorders related to, or resulting from, transplantation. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
5 renal systems, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., renal, kidney, urogenital, immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
* serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 

10 level in healthy tissue from an individual not having the disorder. Molecules of the 
present invention are particularly useful in the diagnosis and treatment of disorders 
related to transplantation, particularly kidney transplantation. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
190 as residues: Asn-49 to Gln-54, Glu-150 to Asp-159. 

15 The tissue distribution in rejected kidney tissue suggests that the protein 

product of this clone would be useful for diagnosis and treatment of disorders related 
to or resulting from rejection of transplanted organs, particularly the kidney. 
Moreover, the protein product of this clone could be used in the treatment and/or 
detection of kidney diseases including renal failure, nephritus, renal tubular acidosis, 

20 proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 
kidney, polycystic kidney, and Falconi's syndrome. Considering the tissue distribution 
and detected ISRE biological activity, the protein is useful in modulating the immune 

25 response to aberrant kidney proteins, including autoantigens and aberrant proteins 
which are often present in degenerative and proliferative conditions. Protein, as well 
as, antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:93 and may have been publicly available prior to conception of 



WO 99/47540 



PCI7US99/05804 



155 

the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1733 of SEQ ID NO:93, b 
is an integer of 15 to 1747, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:93, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

The translation product of this gene shares sequence homology with the 
conserved MAL and plasmolipin protein (Magyar, et al, Gene 189:269-275 (1997); 
See Genbank Accession No.gnllPIDIel 83885), which are thought to be important in 
modulating T cell function, and proper CNS function, respectively. When tested 
against Jurkat cell lines, supernatants removed from cells containing this gene 
activated the GAS (gamma activating sequence) promoter element. Thus, it is likely 
that this gene activates myeloid cells, and to a lesser extent, immune or hematopoietic 
cells and tissues, through the JAK-STAT signal transduction pathway. GAS is a 
promoter element found upstream of many genes which are involved in the Jak-STAT 
pathway. The Jak-STAT pathway is a large, signal transduction pathway involved in 
the differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: NS ARA AR AEI VLGLL VWTLI AGTE YFR VP AFGWV (SEQ 
ID NO:365). Polynucleotides encoding these polypeptides are also encompassed by 
the invention. 

It has been discovered that this gene is expressed primarily in T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of immune, hematopoietic, and neural diseases and/or disorders. Similarly, 
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polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in 
5 certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. Nucleic acids of the present invention are useful as probes for 

10 detecting traumatic and pathological changes in the central and peripheral nervous 

systems. Molecules of the present invention may be involved in regulating the growth 
of Schwann cells and other neural cells. Molecules of the present invention are also 
useful as modulators of the interaction between Schwann cells and other neural cells 
and the extracellular matrix and is therefore useful for the therapeutic intervention in 

15 nerve damage primarily by facilitating regeneration of damaged axons and 
regenerating nerve cells in damaged nervous system tissues. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
191 as residues: Ser-58 to His-64. 

The tissue distribution in T-cells, combined with the homology to the MAL 

20 and plasmolipin proteins and the detected GAS biological activity suggests that the 
protein product of this clone would be useful for the diagnosis and treatment of 
immune disorders including, but not limited to, AIDS and other immunodeficiencies. 
Morever, the expression of this gene product suggests a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 

25 lineages, including blood stem cells. This gene product may be involved in the 

regulation of cytokine production, antigen presentation, or other processes suggesting 
a usefulness in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 

30 agent for immunological disorders including arthritis, asthma, leukemia, rheumatoid 
arthritis, granulomatous disease, inflammatory bowel disease, sepsis, acne, 
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neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated 
cytotoxicity; immune reactions to transplanted organs and tissues, such as host- 
versus-graft and graft- versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
5 erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
10 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, and as nutritional supplements. It may also have a 

15 very wide range of biological activities. Typical of these are cytokine, cell 

proliferation/differentiation modulating activity or induction of other cytokines; 
immunostimulating/immunosuppressant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 

20 stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 
hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's disease); as 

25 antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:94 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 586 of SEQ ID NO:94, b 
is an integer of 15 to 600, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:94, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this clone has sequence identity to a protein 
tyrosine kinase reported by Oates and Wilks (The Worm Breeders Gazette 14:87-87 
(1995), which is hereby incorporated by reference herein). The gene encoding the 
15 disclosed cDNA is believed to reside on chromosome 2. Accordingly, 

polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 2. 

It has been discovered that this gene is expressed primarily in cerebellum, 
adult brain, retina, spinal cord, and kidney cortex. 

20 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neural, visual, and renal diseases 
and/or disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 

25 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the CNS, retina, and kidney cortex. Expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., neural, visual, renal, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 

30 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 
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The tissue distribution in cerebellum, adult brain, and spinal cord tissue 
suggests that the protein product of this clone would be useful for the diagnosis and 
treatment of neural diseases and disorders. The protein product of this clone is useful 
for the detection, treatment, and/or prevention of neurodegenerative disease states, 
5 behavioral disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 

10 compulsive disorder, depression, panic disorder, learning disabilities, ALS, 

psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 

15 formation, neurotransmission, learning, cognition, homeostasis, or neuronal 

differentiation or survival. Moreover, the protein product of this clone could be used 
in the treatment and/or detection of kidney diseases including renal failure, nephritus, 
renal tubular acidosis, proteinuria, pyuria, edema, pyelonephritis, hydronephritis, 
nephrotic syndrome, crush syndrome, glomerulonephritis, hematuria, renal colic and 

20 kidney stones, in addition to Wilm's Tumor Disease, and congenital kidney 

abnormalities such as horseshoe kidney, polycystic kidney, and Falconi's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:95 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 572 of SEQ ID NO:95, b 
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is an integer of 15 to 586, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:95, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

The translation product of this clone has homology to trkB, and it is thought 
that the protein of the present invention is a novel novel neural receptor protein- 
tyrosine kinase, a trkB homolog (See for example, ). This protein is likely to be 
derived from a gene for a ligand-regulated receptor closely related to the human trk 

10 oncogene. Northern (RNA) analysis showed that the trkB gene is expressed 

predominantly in the brain and that trkB expresses multiple mRNAs, ranging from 0.7 
to 9 kb. Hybridization of cerebral mRNAs with a variety of probes indicates that there 
are mRNAs encoding truncated trkB receptors. 

In specific embodiments, polypeptides of the invention comprise the sequence 

15 PCSPPDSPPLPGAFVWRVLWVC (SEQ ID NO:366). Polynucleotides encoding this 
polypeptide are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in breast cancer, 
colon tumor, and B-cell lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

20 identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: breast cancer, colon tumor, B-cell 
lymphoma. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

25 the immune, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., neural, gastrointestinal, immune, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 

30 tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
193 as residues: Ser-29 to Asn-40. 

The tissue distribution in proliferative cells and tissues suggests that the 
protein product of this clone would be useful for the treatment, detection, and/or 
prevention of cancer, particularly in the indicated tissues. The expression within 
cellular sources marked by proliferating cells suggests this protein may play a role in 
the regulation of cellular division, and may show utility in the diagnosis and treatment 
of cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. 

Alternatively, the homology to the trkB protein suggests the protein product of 
this clone is useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain suggests it plays a role in 
normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:96 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 788 of SEQ ID NO:96, b 
is an integer of 15 to 802, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:96, and where b is greater than or equal to a 
+ 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ARACFAYNGVCSEGRCWDSHFHGSV (SEQ ID NO:367), 
MSNMGKIPSLSLHIPINKYICSRIPKPIQKVNKSTVLQICLKRQIILNKNKM 
20 SKIGKANLVQIDIHSLGIVETGCVPSKRYCTLLTEQSGFPFLSHP (SEQ ID 
NO:368), 

MAGCCLKLFGVLSLCFLCGLISIERVICNPVSADFQVSTFCQRHCLLR 
SKVMFXIKGXTATIEVINENCTLVAAPPIGFPIXFL (SEQ ID NO:369), MSDHS 
KIGKANLVQIDIHSLGIVETGCVPSKRYCTLLTEQSGFPFLSHP (SEQ ID 
25 NO:370), MAGCCLKLFGVLSLCFLCGLISIERVICNPVSADFQVSTFCQRHCL 
LRSK (SEQ ID NO:371), VMFXIKGXTATIEVINENCTLVAAPPIGFPIXFL (SEQ 
ID NO:372). Polynucleotides encoding these polypeptides are also encompassed by 
the invention. 

It has been discovered that this gene is expressed primarily in dendritic cells, 
30 and smooth muscle. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune, hematopoietic, and 
vascular diseases and/or disorders. Similarly, polypeptides and antibodies directed to 
5 those polypeptides are useful to provide immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues (e.g., immune, 
hematopoietic, smooth muscle vascular, and cancerous and wounded tissues) or 
10 bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
15 194 as residues: Asp-40 to Ser-52. 

The tissue distribution in dendritic cells suggests that the protein product of 
this clone would be useful for immune disorders. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
20 related to SEQ ID NO: 97 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
25 general formula of a-b, where a is any integer between 1 to 1212 of SEQ ID NO:97, b 
is an integer of 15 to 1226, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:97, and where b is greater than or equal to a 
+ 14. 



30 
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The translation product of this gene shares sequence homology with androgen- 
dependant expressed protein from golden hamster hair follicles which is thought to be 
important in regulating the secretions from glands in the skin (See GenBank 
Accession No. gil 1 9 1 3 1 5). 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: PTEGRQK VLKTFT VPRS AL AMTKTSTCI YHFL VLS WYTF 
LNYYISQEGKDEVKPKILANGARWKY (SEQ ID NO:373), PTEGRQKVLKTF 
TVPRSALAMTKT (SEQ ID NO:375), PRSALAMTKTSTCIYHFLVLSWYTFLN 
YYISQEGK (SEQ ID NO:374), and/or FLNYYISQEGKDEVKPKILANGARWKY 
10 (SEQ ID NO:376). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in lung, colon 
cancer, and testis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of secretory cells 
including cells in the lung, colon, testis and the skin. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

20 of the above tissues or cells, particularly of the secretory epithelial cells in the lung, 

intestine, testis and skin, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
195 as residues: Val-21 to Asp-30, Pro-101 to Thr-109. 

The tissue distribution and homology to androgen regulated protein suggests 
that the protein product of this clone would be useful for treating disorders that 

30 involve highly secretory cells including those in the colon, testis, and skin. It may be 
useful for diagnosing disorders such as colon, lung, or testicular cancer and may be 
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used to treat pulmonary conditions in patients with compromised respiratory function. 
In addition, the polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment and diagnosis of conditions concerning proper testicular 
function (e.g. endocrine function, sperm maturation), as well as cancer. Therefore, 
this gene product is useful in the treatment of male infertility and/or impotence. This 
gene product is also useful in assays designed to identify binding agents, as such 
agents (antagonists) are useful as male contraceptive agents. 

Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that may be expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product may be expressed in other specific tissues or organs 
where it may play related functional roles in other processes, such as hematopoiesis, 
inflammation, bone formation, and kidney function, to name a few possible target 
indications. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:98 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 106 of SEQ ID NO:98, b 
is an integer of 15 to 1 120, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:98, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

The translation product of this gene shares sequence homology with dec-205 a 
transmembrane protein which is thought to be important in antigen presentation in 
dendritic cells and T-cells. 
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It has been discovered that this gene is expressed primarily in macrophage, 
dendritic cells, lung and ulcerative colitis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: inflammatory diseases such as 
ulcerative colitis. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
10 lower levels may be detected in certain tissues (e.g., cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

196 as residues: Asp-30 to Arg-36, Gln-59 to Val-65. 

The distribution in macrophage, dendritic cells, lung and ulcerative colitis 
tissues, and homology to antigen presenting receptors suggests that the protein 
product of this clone would be useful for modulating the immune response in both 

20 acute and chronic inflammatory conditions. Protein, as well as, antibodies directed 

against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:99 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a~b, where a is any integer between 1 to 2582 of SEQ ID NO:99, b 
is an integer of 15 to 2596, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:99, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

5 This gene maps to chromosome 22 and therefore polynucleotides of the 

present invention can be used in linkage analysis as a marker for chromosome 22. 

In specific embodiments, polypeptides of the invention comprise the sequence 
FKDQLVYPLLAFT (SEQ ID NO:377) and/or RQALNLPDVFGLV (SEQ ID 
NO:379). Polnucleotides encoding these polypeptides are also encompassed by the 
10 invention. 

It has been discovered that this gene is expressed primarily in fetal spleen and 
liver as well as cd34 positive cells and to a lesser extent in several tissues suggesting a 
presence in blood or blood forming tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental defects in the 
blood and blood forming cells. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

20 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., fetal spleen and liver as well as cd34 positive cells, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 

25 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
197 as residues: Gln-54 to Gly-61, Asn-79 to Leu-91, Glu-99 to Thr-105, Pro-120 to 
Gin- 126, Pro- 128 to Phe-134, Arg-150 to Arg-156, Arg-160 to Arg-170. 
30 The tissue distribution in fetal spleen and liver as well as cd34 positive cells 

suggests that the protein product of this clone would be useful for treating disorders in 
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the development, proliferation, or regulation of blood forming cells including diseases 
such as lymphomas, granulomas, leukemias, and in the preservation and or 
replenishment of stem cells in the blood. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 100 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1006 of SEQ ID NO: 100, 
b is an integer of 15 to 1020, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 100, and where b is greater than or equal to 
a + 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ATASHDLLLF (SEQ ID NO:379), MS INICLMQS KTQGSCQ 
YLLLPHPVPIILKVSTVFSLLSLFRLLFLSFCPHPKKCSYLLKYYGPLEGHKTLX 

20 YLRTNLGVIQPPLRMYAAEDCNGIG (SEQ ID NO:380), MSINICLMQSKTQG 
SCQYLLLPHPVPIILKVSTVFSLLSLFRLLFL (SEQ ID NO:381), and/or 
SFCPHPK KCSYLLKYYGPLEGHKTLXYLRTNLGVIQPPLRMYAAEDCNGIG 
(SEQ ID NO:382). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

25 It has been discovered that this gene is expressed primarily in T cells, fetal 

heart and chronic lymphocytic leukemia and to a lesser extent in kidney, lung, and 16 
week embryos. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
30 diagnosis of the following diseases and conditions: disorders of the blood including 
abnormalities in T cell function or blood cell proliferation such as leukemia . 
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Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
5 detected in certain tissues or cell types (e.g., T cells, fetal heart and chronic 

lymphocytic leukemia, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

198 as residues: Leu-45 to Val-50. 

The tissue distribution in T cells, fetal heart and chronic lymphocytic leukemia 
suggests that the protein product of this clone would be useful for treating 
abnormalities of the blood particularly those involving T-cells and the abnormal 

15 proliferation of blood cells such as lymphocytic leukemia. In addition, it suggests the 
protein product of this clone is useful for the diagnosis and treatment of a variety of 
immune system disorders. Morever, the expression of this gene product suggests a 
role in regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product may be 

20 involved in the regulation of cytokine production, antigen presentation, or other 

processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 

25 agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 

30 diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
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rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
5 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. 

The expression in fetal heart tissue would suggest a useful role for the protein 
product in developmental abnormalities, fetal deficiencies, pre-natal disorders and 
variouswould-healing models and/or tissue trauma. The tissue distribution in kidney 

10 suggests the protein product of this clone could be used in the treatment and/or 

detection of kidney diseases including renal failure, nephritus, renal tubular acidosis, 
proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 

15 kidney, polycystic kidney, and Falconi's syndrome. 

In addition, the tissue distribution in embryonic tissue suggests the protein 
product of this clone is useful for the diagnosis, detection, and/or treatment of 
developmental disorders. Expression within embryonic tissue and other cellular 
sources marked by proliferating cells suggests this protein may play a role in the 

20 regulation of cellular division, and may show utility in the diagnosis and treatment of 
cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 

25 death, as is believed to occur in acquired immunodeficiency and certain 

neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

30 differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 



WO 99/47540 



PCT/US99/05804 



directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO: 101 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 1506 of SEQ ID NO: 101, 
b is an integer of 15 to 1520, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 101, and where b is greater than or equal to 
a+ 14. 

1 5 FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The translation product of this gene shares sequence homology with ctg4 
which is a glutamine repeat containing gene thought to be a candidate genetic disease 
locus. 

In specific embodiments, polypeptides of the invention comprise the sequence 
20 KEEDDDTERLPS KCE VCKLLSTE (SEQ ID NO:383 and 384) LQAELSRTGRSR 
EVLELGQ (SEQ ID NO:385 and 386), RQAVIVCRRRFV (SEQ ID NO:387), 
PPRWAHPKAPEGSPDPPSPPSALGLSVLPWSDSDPWHISVSPCAQREHYSPGS 
AHINSLRPLPALSLKRCKARVSSSCLYPAPAPAPAPLEIDRCDSVPPVALCSAA 
YTLRIC W AS VLCHRPPPSTS QPKPR ARPKKG KAIFPT AQ VP (SEQ ID NO:388), 
25 PPRWAHPKAPEGSPDPPSPPSALGLSVLPWSDSDPWHISVSPCAQREHYSPGS 
AHINSLRPLPALSLKRCK (SEQ ID NO:389), and/or ARVSSSCLYPAPAPAPAPL 
EIDRCDSVPPVALCSAAYTLRICWASVLCHRPPPSTSQPKPRARPKKGKAIFPT 
AQVP (SEQ ID NO:390). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
30 It has been discovered that this gene is expressed in several tissues including 

lung, heart, kidney, adrenal gland, smooth muscle, cerebellum, and embryonic tissue. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: inherited developmental disorders 
possibly with a neuropsychiatric component. Similarly, polypeptides and antibodies 

5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the nervous system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, 

10 urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
199 as residues: Lys-25 to Ser-36, Ser-53 to Glu-60, Thr-70 to Arg-75, Arg-1 1 1 to 

15 Thr-1 19, Glu-161 to Leu-189. 

The tissue distribution and homology to glutamine repeat family member 
CTG4 suggests that the protein product of this clone would be useful for identifying 
and treating specific diseases related to nucleotide triplet expansion. The tissue 
distribution in embryonic tissue suggests the protein product of this clone is useful for 

20 the diagnosis, detection, and/or treatment of developmental disorders. The relatively 
specific expression of this gene product during embryogenesis suggests it may be a 
key player in the proliferation, maintenance, and/or differentiation of various cell 
types during development. It may also act as a morphogen to control cell and tissue 
type specification. Because of potential roles in proliferation and differentiation, this 

25 gene product may have applications in the adult for tissue regeneration and the 

treatment of cancers. Expression within embryonic tissue and other cellular sources 
marked by proliferating cells suggests this protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis and treatment of cancer and 
other proliferative disorders. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 102 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1292 of SEQ ID NO: 102, 
b is an integer of 15 to 1306, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 102, and where b is greater than or equal to 
a+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: EEKLFTS APGRDFWVMGETRDGNEEN (SEQ ID NO:391). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
15 The gene encoding the disclosed cDNA is believed to reside on chromosome 16. 
Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 16. 

It has been discovered that this gene is expressed primarily in cancerous and 
fetal tissue. 

20 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cancer, developmental anomalies 
or fetal deficiencies. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 

25 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system and developing fetus, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., developmental, reproductive, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) 

30 taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
200 as residues: Met-1 to Ser-6. 
5 The tissue distribution in fetal tissue suggests that the protein product of this 

clone would be useful for the treatment and diagnosis of developmental anomalies or 
fetal deficiencies. In addition to fetal tissue, expression in a variety of cancerous 
tissues suggests a role in the treatment and diagnosis of uncontrolled cell proliferation 
and/or differentiation (e.g. cancer). Moreover, the expression within embryonic tissue 

10 and other cellular sources marked by proliferating cells suggests this protein may play 
a role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. 

Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 

15 result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 

20 and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 103 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 771 of SEQ ID NO: 103, b 
is an integer of 15 to 785, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 103, and where b is greater than or equal to 
a+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. 

10 This gene is expressed primarily in hypothalamus, T-cells, and adipose tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune (e.g. immunodeficiencies, 
autoimmunities, inflammation, leukemias & lymphomas) and neurological (e.g. 

15 Alzheimer's disease, dementia, schizophrenia) disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous, hematopoietic and 
immune systems, expression of this gene at significantly higher or lower levels may 

20 be detected in certain tissues (e.g., immune, neural, metabolic, and cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. The tissue distribution suggests that the protein product of this 

25 clone would be useful in the intervention or detection of pathologies associated with 
the hematopoietic and immune systems, such as anemias (leukemias). In addition, the 
expression in brain (including fetal) might suggest a role in developmental brain 
defects, neuro-degenerative diseases or behavioral abnomalities (e.g. schizophrenia, 
Alzheimer's, dementia, depression, etc.). 

30 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

201 as residues: Phe-64 to Gly-77, Pro-83 to Asp-99. 
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The tissue distribution in hypothalamus suggests the protein product of this 
clone is useful for the detection, treatment, and/or prevention of neurodegenerative 
disease states, behavioral disorders, or inflammatory conditions which include, but 
are not limited to Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, 
5 Tourette Syndrome, meningitis, encephalitis, demyelinating diseases, peripheral 
neuropathies, neoplasia, trauma, congenital malformations, spinal cord injuries, 
ischemia and infarction, aneurysms, hemorrhages, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, depression, panic disorder, learning 
disabilities, ALS, psychoses, autism, and altered behaviors, including disorders in 

10 feeding, sleep patterns, balance, and perception. In addition, elevated expression of 
this gene product in regions of the brain suggests it plays a role in normal neural 
function. Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. This gene product may be involved in the regulation of cytokine production, 

15 antigen presentation, or other processes suggesting a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

20 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

25 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 

30 expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Moreover, the protein 
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product of this clone is useful for the diagnosis, prevention, and/or treatment of 
various metabolic disorders which include, but are not limited to, Tay-Sachs disease, 
phenylkenonuria, galactosemia, hyperlipidemias, porphyrias, and Hurler's syndrome. 
The protein is useful in the treatment and/or prevention of neurodegenerative 
5 conditions, particularly those which occur secondary to aberrant fatty acid 

metabolism (i.e. defects which affect the synthesis and integrity of the myelin sheath). 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 104 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2001 of SEQ ID NO: 104, 
b is an integer of 15 to 2015, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 104, and where b is greater than or equal to 
a+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

The translation product of this gene was shown to have homology to the 
murine leucine-rich repeat protein (See Genbank Accession No. gil2880079), which is 
thought to be important in neural development. 

25 In specific embodiments, the polypeptides of the invention comprise the 

sequence:QKPTFALGELYPPLINLWEAGKEKSTSLKVKATVIGLPTNMS (SEQ 
ID NO: 392). Polynucleotides encoding this polypeptide are also encompassed by the 
invention. The gene encoding the disclosed cDNA is believed to reside on 
chromosome 7. Accordingly, polynucleotides related to this invention are useful as 

30 a marker in linkage analysis for chromosome 7. 
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It has been discovered that this gene is expressed primarily in T-cells and 

brain. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: immunodeficiency, tumor 
necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders, neurological diseases of the brain such 
as depression, schizophrenia, Alzheimer's disease, Parkinson's disease, Huntington's 
disease, dementia and specific brain tumors. Similarly, polypeptides and antibodies 

10 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain and immune system, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, immune, hematopoietic, and cancerous and wounded tissues) 

15 or bodily fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

20 202 as residues: Met-24 to Gly-29, Ala-57 to Thr-63. 

The tissue distribution in T-cells suggests that the protein product of this clone 
would be useful for the diagnosis and treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
supressive conditions (transplantation) and hematopoeitic disorders. In addition this 

25 gene product may be applicable in conditions of general microbial infection, 

inflammation or cancer. The expression in brain, combined with the homology to the 
leucine-rich repeat protein suggests that the protein product of this clone would be 
useful for the treatment and diagnosis of developmental, degenerative and behavioral 
conditions of the brain and nervous system, such as depression, schizophrenia, 

30 Alzheimer's disease, Parkinson's disease, Huntington's disease, Tourette Syndrome, 
mania, dementia, paranoia, addictive behavior, obsessive-compulsisve disorder and 
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sleep disorders. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO: 105 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 353 of SEQ ID NO: 105, b 
is an integer of 15 to 367, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 105, and where b is greater than or equal to 
a+ 14. 
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Table 1 summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled 
from partially homologous ("overlapping") sequences obtained from the "cDNA 
clone ID" identified in Table 1 and, in some cases, from additional related DNA 
5 clones. The overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits 
10 contain multiple different clones corresponding to the same gene. "Vector" refers to 
the type of vector contained in the cDNA Clone ID. 

"Total NT Seq." refers to the total number of nucleotides in the contig 
identified by "Gene No." The deposited clone may contain all or most of these 
sequences, reflected by the nucleotide position indicated as "5* NT of Clone Seq." 
15 and the "3' NT of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID 
NO:X of the putative start codon (methionine) is identified as "5* NT of Start Codon." 
Similarly , the nucleotide position of SEQ ID NO:X of the predicted signal sequence 
is identified as "5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is 
20 identified as "AA SEQ ID NO: Y," although other reading frames can also be easily 
translated using known molecular biology techniques. The polypeptides produced by 
these alternative open reading frames are specifically contemplated by the present 
invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
25 peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The 
predicted first amino acid position of SEQ ID NO:Y of the secreted portion is 
identified as "Predicted First AA of Secreted Portion." Finally, the amino acid 
position of SEQ ID NO: Y of the last amino acid in the open reading frame is 
identified as "Last AA of ORF." 
30 SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 

otherwise suitable for a variety of uses well known in the art and described further 
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below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the 
cDNA contained in the deposited clone. These probes will also hybridize to nucleic 
acid molecules in biological samples, thereby enabling a variety of forensic and 
5 diagnostic methods of the invention. Similarly, polypeptides identified from SEQ ID 
NO: Y may be used to generate antibodies which bind specifically to the secreted 
proteins encoded by the cDNA clones identified in Table 1. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 

10 deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino 
acid sequence. In these cases, the predicted amino acid sequence diverges from the 
actual amino acid sequence, even though the generated DNA sequence may be greater 
than 99.9% identical to the actual DNA sequence (for example, one base insertion or 

15 deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:X and the predicted 
translated amino acid sequence identified as SEQ ID NO:Y, but also a sample of 

20 plasmid DNA containing a human cDNA of the invention deposited with the ATCC, 
as set forth in Table 1 . The nucleotide sequence of each deposited clone can readily 
be determined by sequencing the deposited clone in accordance with known methods. 
The predicted amino acid sequence can then be verified from such deposits. 
Moreover, the amino acid sequence of the protein encoded by a particular clone can 

25 also be directly determined by peptide sequencing or by expressing the protein in a 
suitable host cell containing the deposited human cDNA, collecting the protein, and 
determining its sequence. 

The present invention also relates to the genes corresponding to SEQ ID 
NO:X, SEQ ID NO: Y, or the deposited clone. The corresponding gene can be 

30 isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include preparing probes or primers from the disclosed 
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sequence and identifying or amplifying the corresponding gene from appropriate 
sources of genomic material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from 
5 the sequences provided herein and screening a suitable nucleic acid source for the 
desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly 
produced polypeptides, synthetically produced polypeptides, or polypeptides 
10 produced by a combination of these methods. Means for preparing such polypeptides 
are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
It is often advantageous to include an additional amino acid sequence which contains 
15 secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantly produced 
20 version of a polypeptide, including the secreted polypeptide, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67:31-40 
(1988). Polypeptides of the invention also can be purified from natural or 
recombinant sources using antibodies of the invention raised against the secreted 
protein in methods which are well known in the art. 

25 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of 
McGeoch, Virus Res. 3:271-286 (1985), uses the information from a short N-terminal 
30 charged region and a subsequent uncharged region of the complete (uncleaved) 

protein. The method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the 
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information from the residues surrounding the cleavage site, typically residues -13 to 
+2, where +1 indicates the amino terminus of the secreted protein. The accuracy of 
predicting the cleavage points of known mammalian secretory proteins for each of 
these methods is in the range of 75-80%. (von Heinje, supra.) However, the two 
5 methods do not always produce the same predicted cleavage point(s) for a given 

protein. -.„. 

In the present case, the deduced amino acid sequence of the secreted 
polypeptide was analyzed by a computer program called SignalP (Henrik Nielsen et 
aL, Protein Engineering 10:1-6 (1997)), which predicts the cellular location of a 
10 protein based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The analysis 
of the amino acid sequences of the secreted proteins described herein by this program 
provided the results shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
15 vary from organism to organism and cannot be predicted with absolute certainty. 

Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO: Y which have an N-terminus beginning within 5 residues (i.e., 
+ or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that 
in some cases, cleavage of the signal sequence from a secreted protein is not entirely 
20 uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present 
invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the 
25 naturally occurring signal sequence may be further upstream from the predicted signal 
sequence. However, it is likely that the predicted signal sequence will be capable of 
directing the secreted protein to the ER. These polypeptides, and the polynucleotides 
encoding such polypeptides, are contemplated by the present invention. 

30 Polynucleotide and Pol ypeptide Variants 
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"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential 
properties thereof. Generally, variants are overall closely similar, and, in many 
regions, identical to the polynucleotide or polypeptide of the present invention. 
5 By a polynucleotide having a nucleotide sequence at least, for example, 95% 

"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
mutations per each 100 nucleotides of the reference nucleotide sequence encoding the 

10 polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence 
at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides 
in the reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. The query sequence may be an entire 

15 sequence shown inTable 1, the ORF (open reading frame), or any fragement specified 
as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 

20 computer programs. A preferred method for determing the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. 
Biosci. (1990) 6:237-245). In a sequence alignment the query and subject sequences 

25 are both DNA sequences. An RNA sequence can be compared by converting ITs to 
Ts. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB alignment of DNA sequences to calculate percent 
identiy are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining Penal ty=30, 
Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 

30 0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever is 
shorter. 
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If the subject sequence is shorter than the query sequence because of 5' or 3* 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
5 sequences truncated at the 5' or 3' ends, relative to the the query sequence, the 

percent identity is corrected by calculating the number of bases of the query sequence 
that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent 
of the total bases of the query sequence. Whether a nucleotide is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
10 subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only bases outside the 
5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, 
which are not matched/aligned with the query sequence, are calculated for the 
15 purposes of manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the 
subject sequence and therefore, the FASTDB alignment does not show a 
matched/alignement of the first 10 bases at 5 7 end. The 10 unpaired bases represent 
20 10% of the sequence (number of bases at the 5' and 3' ends not matched/total number 
of bases in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 bases were perfectly 
matched the final percent identity would be 90%. In another example, a 90 base 
subject sequence is compared with a 100 base query sequence. This time the 
25 deletions are internal deletions so that there are no bases on the 5' or 3' of the subject 
sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only bases 5' 
and 3' of the subject sequence which are not matched/aligned with the query sequnce 
are manually corrected for. No other manual corrections are to made for the purposes 
30 of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
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"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the query amino acid sequence. In other 
5 words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 
terminal positions of the reference amino acid sequence or anywhere between those 
10 terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 

15 determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of 
Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the 

20 query and subject sequences are either both nucleotide sequences or both amino acid 
sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, 
k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 

25 Penalty =0.05, Window Size=500 or the length of the subject amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be 
made to the results. This is becuase the FASTDB program does not account for N- 

30 and C-terminal truncations of the subject sequence when calculating global percent 
identity. For subject sequences truncated at the N- and C-termini, relative to the the 
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query sequence, the percent identity is corrected by calculating the number of residues 
of the query sequence that are N- and C-terminal of the subject sequence, which are 
not matched/aligned with a corresponding subject residue, as a percent of the total 
bases of the query sequence. Whether a residue is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score 
is what is used for the purposes of the present invention. Only residues to the N- and 
C-termini of the subject sequence, which are not matched/aligned with the query 
sequence, are considered for the purposes of manually adjusting the percent identity 
score. That is, only query residue positions outside the farthest N- and C-terminal 
residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
show a matching/alignment of the first 10 residues at the N-terminus. The 10 
unpaired residues represent 10% of the sequence (number of residues at the N- and C- 
termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 residues were perfectly matched the final percent identity would be 
90%. In another example, a 90 residue subject sequence is compared with a 100 
residue query sequence. This time the deletions are internal deletions so there are no 
residues at the N- or C-termini of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal 
ends of the subject sequence, as displayed in the FASTDB alignment, which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding 
regions, or both. Especially preferred are polynucleotide variants containing 
alterations which produce silent substitutions, additions, or deletions, but do not alter 
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the properties or activities of the encoded polypeptide. Nucleotide variants produced 
by silent substitutions due to the degeneracy of the genetic code are preferred. 
Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or 
added in any combination are also preferred. Polynucleotide variants can be produced 
5 for a variety of reasons, e.g., to optimize codon expression for a particular host 

(change codons in the human mRNA to those preferred by a bacterial host such as E. 
coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 

10 organism. (Genes II, Lewin, B„ ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 

1 5 technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et aL, J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 

10 deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from 
the carboxy terminus of this protein. (Dobeli et aL, J. Biotechnology 7: 199-216 
(1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
15 activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every 
0 possible amino acid position. The investigators found that "[m]ost of the molecule 
could be altered with little effect on either [binding or biological activity]." (See, 



3NSDOCID: <WO 9947S40A1 I > 



WO 99/47540 



PCT/US99/05804 



Abstract.) In fact, only 23 unique amino acid sequences, out of more than 3,500 
nucleotide sequences examined, produced a protein that significantly differed in 
activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or 
5 C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of 
a deletion variant to induce and/or to bind antibodies which recognize the secreted 
form will likely be retained when less than the majority of the residues of the secreted 
form are removed from the N-terminus or C-terminus. Whether a particular 

10 polypeptide lacking N- or C-terminal residues of a protein retains such immunogenic 
activities can readily be determined by routine methods described herein and 
otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show 
substantial biological activity. Such variants include deletions, insertions, 

15 inversions, repeats, and substitutions selected according to general rules known in the 
art so as have little effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al., 
Science 247:1306-1310 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

20 The first strategy exploits the tolerance of amino acid substitutions by natural 

selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 

25 positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the 
protein. 

The second strategy uses genetic engineering to introduce amino acid changes 
at specific positions of a cloned gene to identify regions critical for protein function. 
30 For example, site directed mutagenesis or alanine-scanning mutagenesis (introduction 
of single alanine mutations at every residue in the molecule) can be used. 
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(Cunningham and Wells, Science 244:1081-1085 (1989).) The resulting mutant 
molecules can then be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
5 amino acid changes are likely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and 
10 lie; replacement of the hydroxy 1 residues Ser and Thr; replacement of the acidic 

residues Asp and Glu; replacement of the amide residues Asn and Gin, replacement of 
the basic residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, 
and Trp, and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

15 Besides conservative amino acid substitution, variants of the present invention 

include (i) substitutions with one or more of the non-conserved amino acid residues, 
where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 

20 such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 
acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

25 For example, polypeptide variants containing amino acid substitutions of 

charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of 
pharmaceutical formulations both reduces activity and increases clearance due to the 
aggregate's immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 

30 (1967); Robbins et al., Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. 
Therapeutic Drug Carrier Systems 10:307-377 (1993).) 
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A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of the present invention having an amino acid 
sequence which contains at least one amino acid substitution, but not more than 50 
amino acid substitutions, even more preferably, not more than 40 amino acid 
5 substitutions, still more preferably, not more than 30 amino acid substitutions, and 
still even more preferably, not more than 20 amino acid substitutions. Of course, in 
order of ever-increasing preference, it is highly preferable for a polypeptide to have 
an amino acid sequence which comprises the amino acid sequence of the present 
invention, which contains at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
10 amino acid substitutions. In specific embodiments, the number of additions, 
substitutions, and/or deletions in the amino acid sequence of the present invention or 
fragments thereof (e.g., the mature form and/or other fragments described herein), is 
1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are 
preferable. 

15 

Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 

20 15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 
nt, and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt 
in length," for example, is intended to include 20 or more contiguous bases from the 
cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and 

25 primers as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 
2000 nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the 
invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 

30 450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 901- 
950,951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251- 
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1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951- 
2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the deposited 
clone. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed 
herein. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO: Y or encoded by the cDNA contained in the 
deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21- 
40, 41-60, 61-80, 81-100, 102-120, 121-140, 141-160, or 161 to the end of the coding 
region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 1 10, 120, 130, 140, or 150 amino acids in length. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be 
deleted from the carboxy terminus of the secreted protein or mature form. 
Furthermore, any combination of the above amino and carboxy terminus deletions are 
preferred. Similarly, polynucleotide fragments encoding these polypeptide fragments 
are also preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and 
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alpha-helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. 
5 Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically 
active fragments are those exhibiting activity similar, but not necessarily identical, to 
10 an activity of the polypeptide of the present invention. The biological activity of the 
fragments may include an improved desired activity, or a decreased undesirable 
activity. 

Epitopes & Antibodies 

15 In the present invention, "epitopes" refer to polypeptide fragments having 

antigenic or immunogenic activity in an animal, especially in a human. A preferred 
embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 

20 contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et al., Proc. Natl. Acad. Sci. USA 
81:3998- 4002(1983).) 

Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) 

25 further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson 

30 et ah, Cell 37:767-778 (1984); Sutcliffe, J. G. et al., Science 219:660-666 (1983).) 
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Similarly, immunogenic epitopes can be used to induce antibodies according 
to methods well known in the art. (See, for instance, Sutcliffe et al. f supra; Wilson et 
al., supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
5 the secreted protein. The immunogenic epitopes may be presented together with a 
- carrier-protein, such as an albuminrto-an animal system (such as rabbit or mouse) or, 
if it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to 
be sufficient to raise antibodies capable of binding to, at the very least, linear epitopes 
10 in a denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody 11 (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules as well as antibody fragments (such as, for 
example, Fab and F(ab')2 fragments) which are capable of specifically binding to 
protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 
15 more rapidly from the circulation, and may have less non-specific tissue binding than 
an intact antibody. (Wahl et al., J. Nucl. Med. 24:316-325 (1983).) Thus, these 
fragments are preferred, as well as the products of a FAB or other immunoglobulin 
expression library. Moreover, antibodies of the present invention include chimeric, 
single chain, and humanized antibodies. 

20 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion 
proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
25 polypeptide of the present invention can be used to indirectly detect the second 
protein by binding to the polypeptide. Moreover, because secreted proteins target 
cellular locations based on trafficking signals, the polypeptides of the present 
invention can be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present 
30 invention include not only heterologous signal sequences, but also other heterologous 
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functional regions. The fusion does not necessarily need to be direct, but may occur 
through linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics 
of the polypeptide of the present invention. For instance, a region of additional amino 
5 acids, particularly charged amino acids, may be added to the N-terminus of the 

polypeptide to improve stability and persistence during purification from the host cell 
or subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties to facilitate handling 
10 of polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 
15 describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains 
of mammalian immunoglobulins. (EP A 394,827; Traunecker et aL, Nature 33 1 :84- 
86 (1988).) Fusion proteins having disulfide-linked dimeric structures (due to the 
IgG) can also be more efficient in binding and neutralizing other molecules, than the 
20 monomeric secreted protein or protein fragment alone. (Fountoulakis et aL, J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a 
25 fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 

example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if 
the fusion protein is used as an antigen for immunizations. In drug discovery, for 
30 example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, 
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D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. 
5 In preferred embodiments, the marker amino acid sequence is a hexa-histidine 

peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chats worth, CA, 9131 1), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
10 Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 
(1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

15 

Vectors, Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of 
the present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
20 vector. Retroviral vectors may be replication competent or replication defective. In 
the latter case, viral propagation generally will occur only in complementing host 
cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
25 such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
and then transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
30 promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to 
name a few. Other suitable promoters will be known to the skilled artisan. The 



WO 99/47540 



PCT/US99/05804 



expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (U AA, UG A or 

5 UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples 

10 of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture 
mediums and conditions for the above-described host cells are known in the art. 

15 Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE- 

9, available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
pNH16a, pNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRJT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 

20 and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 

25 are described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1986). It is specifically contemplated that the 
polypeptides of the present invention may in fact be expressed by a host cell lacking a 
recombinant vector. 

A polypeptide of this invention can be recovered and purified from 

30 recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
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phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

5 Polypeptides of the present invention, and preferably the secreted form, can 

also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 

10 insect, and mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be glycosylated 
or may be non-glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as a result of host- 
mediated processes. Thus, it is well known in the art that the N-terminal methionine 

15 encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal 
methionine on most proteins also is efficiently removed in most prokaryotes, for some 
proteins, this prokaryotic removal process is inefficient, depending on the nature of 
the amino acid to which the N-terminal methionine is covalently linked. 

20 In addition to encompassing host cells containing the vector constructs 

discussed herein, the invention also encompasses primary, secondary, and 
immortalized host ceils of vertebrate origin, particularly mammalian origin, that have 
been engineered to delete or replace endogenous genetic material (e.g., coding 
sequence), and/or to include genetic material (e.g., heterologous polynucleotide 

25 sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 
regions (e.g., promoter and/or enhancer) and endogenous polynucleotide sequences 
via homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 

30 1997; International Publication No. WO 96/2941 1, published September 26, 1996; 
International Publication No. WO 94/12650, published August 4, 1994; Koller et al., 
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Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al., Nature 342:435- 
438 (1989), the disclosures of each of which are incorporated by reference in their 
entireties). 

5 

Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 
known techniques. 

10 The polynucleotides of the present invention are useful for chromosome 

identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present 
invention can be used as a chromosome marker. 

15 Briefly, sequences can be mapped to chromosomes by preparing PCR primers 

(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of 
somatic cell hybrids containing individual human chromosomes. Only those hybrids 

20 containing the human gene corresponding to the SEQ ID NO:X will yield an 
amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides 

25 can be achieved with panels of specific chromosome fragments. Other gene mapping 
strategies that can be used include in situ hybridization, prescreening with labeled 
flow-sorted chromosomes, and preselection by hybridization to construct 
chromosome specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved 

30 using fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. 
This technique uses polynucleotides as short as 500 or 600 bases; however, 
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polynucleotides 2,000-4,000 bp are preferred. For a review of this technique, see 
Verma et ah, "Human Chromosomes: a Manual of Basic Techniques," Pergamon 
Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, 
the physical position of the polynucleotide can be used in linkage analysis. Linkage 
analysis establishes coinheritance between a chromosomal location and presentation 
of a particular disease. (Disease mapping data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) .) Assuming 1 megabase mapping resolution and 
one gene per 20 kb, a cDNA precisely localized to a chromosomal region associated 
with the disease could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 
translocations, are examined in chromosome spreads or by PCR. If no structural 
alterations exist, the presence of point mutations are ascertained. Mutations observed 
in some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide 
and the corresponding gene from several normal individuals is required to distinguish 
the mutation from a polymorphism. If a new polymorphism is identified, this 
polymorphic polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression, 
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chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 

5 rely on binding of the polynucleotide to DNA or RNA. For these techniques, 

preferred polynucleotides are usually 20 to 40 bases in length and complementary to 
either the region of the gene involved in transcription (triple helix - see Lee et al., 
Nucl. Acids Res. 6:3073 (1979); Cooney et aL, Science 241:456 (1988); and Dervan 
et al., Science 25 1 : 1360 ( 1991) ) or to the mRNA itself (antisense - Okano, J. 

10 Neurochem. 56:560 (1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988).) Triple helix formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA 
hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques are effective in model systems, and the information disclosed herein can 

15 be used to design antisense or triple helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 

20 manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. 

25 In this technique, an individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield unique bands for 
identifying personnel. This method does not suffer from the current limitations of 
"Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The polynucleotides of the present invention can be used as additional DNA 

30 markers for RFLP. 
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The polynucleotides of the present invention can also be used as an alternative 
to RFLP, by determining the actual base-by-base DNA sequence of selected portions 
of an individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, which can then be sequenced. Using 
this technique, individuals can be identified because each individual will have a 
unique set of DNA sequences. Once an unique ID database is established for an 
individual, positive identification of that individual, living or dead, can be made from 
extremely small tissue samples. 

Forensic biology also benefits from using DNA-based identification 
techniques as disclosed herein. DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, 
etc., can be amplified using PCR. In one prior art technique, gene sequences 
amplified from polymorphic loci, such as DQa class II HLA gene, are used in forensic 
biology to identify individuals. (Erlich, H., PCR Technology, Freeman and Co. 
(1992).) Once these specific polymorphic loci are amplified, they are digested with 
one or more restriction enzymes, yielding an identifying set of bands on a Southern 
blot probed with DNA corresponding to the DQa class II HLA gene. Similarly, 
polynucleotides of the present invention can be used as polymorphic markers for 
forensic purposes. 

There is also a need for reagents capable of identifying the source of a 
particular tissue. Such need arises, for example, in forensics when presented with 
tissue of unknown origin. Appropriate reagents can comprise, for example, DNA 
probes or primers specific to particular tissue prepared from the sequences of the 
present invention. Panels of such reagents can identify tissue by species and/or by 
organ type. In a similar fashion, these reagents can be used to screen tissue cultures 
for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making 
oligomers for attachment to a "gene chip" or other support, to raise anti-DNA 
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antibodies using DNA immunization techniques, and as an antigen to elicit an 
immune response. 

Uses of the Polypeptides 

5 Each of the polypeptides identified herein can be used in numerous ways. The 

following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression 
in tissues can be studied v with classical immunohistological methods. (Jalkanen, M., 

10 et al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
(ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 

15 as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 

20 imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
radiography, suitable labels include radioisotopes such as barium or cesium, which 
emit detectable radiation but are not overtly harmful to the subject. Suitable markers 
for NMR and ESR include those with a detectable characteristic spin, such as 
deuterium, which may be incorporated into the antibody by labeling of nutrients for 

25 the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear 
magnetic resonance, is introduced (for example, parenterally, subcutaneously, or 

30 intraperitoneally) into the mammal. It will be understood in the art that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
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needed to produce diagnostic images. In the case of a radioisotope moiety, for a 
human subject, the quantity of radioactivity injected will normally range from about 5 
to 20 millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. 
5 In vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics 
of Radiolabeled Antibodies and -Their Fragments:" (Chapter 13 in Tumor Imaging: 
The Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., 
Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which 
10 involves (a) assaying the expression of a polypeptide of the present invention in cells 
or body fluid of an individual; (b) comparing the level of gene expression with a 
standard gene expression level, whereby an increase or decrease in the assayed 
polypeptide gene expression level compared to the standard expression level is 
indicative of a disorder. 

15 Moreover, polypeptides of the present invention can be used to treat disease. 

For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 

20 activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can 

25 also be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, 
such as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 

30 molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 

columns using methods well known to those of skill in the art. Polypeptides can also 
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be used to raise antibodies, which in turn are used to measure protein expression from 
a recombinant cell, as a way of assessing transformation of the host cell. Moreover, 
the polypeptides of the present invention can be used to test the following biological 
activities. 

5 

Biological Activities 

The polynucleotides and polypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
10 may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 

15 treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, 
red blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) 
cells from pluripotent stem cells. The etiology of these immune deficiencies or 

20 disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 

acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, a polynucleotide 
or polypeptide of the present invention can be used as a marker or detector of a 
particular immune system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useful in 

25 treating or detecting deficiencies or disorders of hematopoietic cells. A 

polypeptide or polynucleotide of the present invention could be used to increase 
differentiation and proliferation of hematopoietic cells, including the pluripotent stem 
cells, in an effort to treat those disorders associated with a decrease in certain (or 
many) types hematopoietic cells. Examples of immunologic deficiency syndromes 

30 include, but are not limited to: blood protein disorders (e.g. agammaglobulinemia, 
dysgammaglobulinemia), ataxia telangiectasia, common variable immunodeficiency, 
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Digeorge Syndrome, HIV infection, HTLV-BLV infection, leukocyte adhesion 
deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that 
can decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
treating or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of 
the host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the 
present invention include, but are not limited to: Addison's Disease, hemolytic 
anemia, antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 
encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, 
Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 
Pemphigus, Polyendocrinopathies, Purpura, Reiter's Disease, Stiff Man Syndrome, 
Autoimmune Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary 
Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and 
autoimmune inflammatory eye disease. 



WO 99/47540 



PCT/US99/05804 



216 



Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by a polypeptide 
or polynucleotide of the present invention. Moreover, these molecules can be used to 
treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood group 

5 incompatibility. 

A polynucleotide or polypeptide of the present invention may also be used to 
treat and/or prevent organ rejection or graft- versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 

10 this case, the foreign transplanted immune cells destroy the host tissues. The 

administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of 
T-cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also 

15 be used to modulate inflammation. For example, the polypeptide or polynucleotide 
may inhibit the proliferation and differentiation of cells involved in an inflammatory 
response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 

20 reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 
disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 

25 Hyperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect 
hyperproliferative disorders, including neoplasms. A polypeptide or polynucleotide 
of the present invention may inhibit the proliferation of the disorder through direct or 
indirect interactions. Alternatively, a polypeptide or polynucleotide of the present 
30 invention may proliferate other cells which can inhibit the hyperproliferative disorder. 
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For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells, hyperproliferative disorders can be treated. 
This immune response may be increased by either enhancing an existing immune 
response, or by initiating a new immune response. Alternatively, decreasing an 
immune response may also be a method of treating hyperproliferative disorders, such 
as a chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by 
a polynucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are not limited to: 

hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, purpura, 
sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, Gaucher' s 
Disease, histiocytosis, and any other hyperproliferative disease, besides neoplasia, 
located in an organ system listed above. 

Infectious Disease 

A polypeptide or polynucleotide of the present invention can be used to treat 
or detect infectious agents. For example, by increasing the immune response, 
particularly increasing the proliferation and differentiation of B and/or T cells, 
infectious diseases may be treated. The immune response may be increased by either 
enhancing an existing immune response, or by initiating a new immune response. 
Alternatively, the polypeptide or polynucleotide of the present invention may also 
directly inhibit the infectious agent, without necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
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present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as, Cytomegalovirus, Herpes 
5 Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, 
Picornaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 

10 symptoms, including, but not limited to: arthritis, bronchiolitis, encephalitis, eye 

infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, 
Parainfluenza, Rabies, the common cold, Polio, leukemia, Rubella, sexually 

15 transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and 
that can be treated or detected by a polynucleotide or polypeptide of the present 

20 invention include, but not limited to, the following Gram-Negative and Gram-positive 
bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceae (e.g., Anthrax, Clostridium), 
Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocy coses, 

25 Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), Erysipelothrix, 

Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae 
(e.g., Acinetobacter, Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., 
Actinobacillus, Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, 
Chlamydiaceae, Syphilis, and Staphylococcal. These bacterial or fungal families can 

30 cause the following diseases or symptoms, including, but not limited to: bacteremia, 
endocarditis, eye infections (conjunctivitis, tuberculosis, uveitis), gingivitis, 
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opportunistic infections (e.g., AIDS related infections), paronychia, prosthesis-related 
infections, Reiter's Disease, respiratory tract infections, such as Whooping Cough or 
Empyema, sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, 
food poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
5 Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, 

tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin 
diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound 
infections. A polypeptide or polynucleotide of the present invention can be used to 
treat or detect any of these symptoms or diseases. 

10 Moreover, parasitic agents causing disease or symptoms that can be treated or 

detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, 
Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and 

15 Trichomonas. These parasites can cause a variety of diseases or symptoms, including, 
but not limited to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., 
dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS 
related), Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or 
polynucleotide of the present invention can be used to treat or detect any of these 

20 symptoms or diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a 
polynucleotide of the present invention, and returning the engineered cells to the 

25 patient (ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present 
invention can be used as an antigen in a vaccine to raise an immune response against 
infectious disease. 



Regeneration 

A polynucleotide or polypeptide of the present invention can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
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Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, burns, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 

5 reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vasculature (including vascular and lymphatics), nervous, hematopoietic, 
and skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration 

10 occurs without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would quicken recovery time after damage. A 
polynucleotide or polypeptide of the present invention could also be used 

15 prophylactically in an effort to avoid damage. Specific diseases that could be treated 
include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. A 
further example of tissue regeneration of non-healing wounds includes pressure 
ulcers, ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 
Similarly, nerve and brain tissue could also be regenerated by using a 

20 polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 

25 neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 

30 

Chemotaxis 
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A polynucleotide or polypeptide of the present invention may have 
chemotaxis activity. A chemotaxic molecule attracts or mobilizes cells (e.g., 
monocytes, fibroblasts, neutrophils, T-celis, mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body, such as inflammation, infection, or 
5 site of hyperproliferation. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be used 
to treat inflammation, infection, hyperproliferative disorders, or any immune system 
10 disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of 
the present invention can also attract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
15 invention may inhibit chemotactic activity. These molecules could also be used to 
treat disorders. Thus, a polynucleotide or polypeptide of the present invention could 
be used as an inhibitor of chemotaxis. 

Binding Activity 

20 A polypeptide of the present invention may be used to screen for molecules 

that bind to the polypeptide or for molecules to which the polypeptide binds. The 
binding of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 

25 small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coligan et al., Current Protocols in Immunology 
l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 

30 receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
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of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
cells which express the polypeptide, either as a secreted protein or on the cell 
5 membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E. coli. 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially 
containing the molecule to observe binding, stimulation, or inhibition of activity of 

either the polypeptide or the molecule. 
10 The assay may simply test binding of a candidate compound to the 

polypeptide, wherein binding is detected by a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the 
candidate compound results in a signal generated by binding to the polypeptide. 
Alternatively, the assay can be carried out using cell-free preparations, 
15 polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the polypeptide/molecule activity or binding to a 
standard. 

20 Preferably, an ELISA assay can measure polypeptide level or activity in a 

sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 
All of these above assays can be used as diagnostic or prognostic markers. 

25 The molecules discovered using these assays can be used to treat disease or to bring 
about a particular result in a patient (e.g., blood vessel growth) by activating or 
inhibiting the polypeptide/molecule. Moreover, the assays can discover agents which 
may inhibit or enhance the production of the polypeptide from suitably manipulated 
cells or tissues. 

30 Therefore, the invention includes a method of identifying compounds which 

bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
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candidate binding compound with a polypeptide of the invention; and (b) determining 
if binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound 
with a polypeptide of the invention, (b) assaying a biological activity , and (b) 
determining if a biological activity of the polypeptide has been altered. 

Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

A polypeptide or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color, skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 
used to modulate mammalian metabolism affecting catabolism, anabolism, 
processing, utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to 
change a mammal* s mental state or physical state by influencing biorhythms, 
caricadic rhythms, depression (including depressive disorders), tendency for violence, 
tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-like 
activity), hormonal or endocrine levels, appetite, libido, memory, stress, or other 
cognitive qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 

Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% 
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identical to a sequence of at least about 50 contiguous nucleotides in the nucleotide 
sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
5 positions beginning with the nucleotide at about the position of the 5* Nucleotide of 
the Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
10 positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Start Codon and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
15 range of positions beginning with the nucleotide at about the position of the 5' 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the 
nucleotide at about the position of the 3* Nucleotide of the Clone Sequence as defined 
for SEQ ID NO:X in Table L 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
20 sequence which is at least 95% identical to a sequence of at least about 150 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 
25 A further preferred embodiment is a nucleic acid molecule comprising a 

nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of 
the First Amino Acid of the Signal Peptide and ending with the nucleotide at about 
the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X 
30 in Table L 
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A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence of SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
5 stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic 
acid molecule which hybridizes does not hybridize under stringent hybridization 
conditions to a nucleic acid molecule having a nucleotide sequence consisting of only 
A residues or of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
10 comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
15 sequence which is at least 95% identical to a sequence of at least 50 contiguous 

nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1, which DNA molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of 
20 at least 50 contiguous nucleotides is included in the nucleotide sequence of the 
complete open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule 

comprising a nucleotide sequence which is at least 95% identical to sequence of at 
least 500 contiguous nucleotides in the nucleotide sequence encoded by said human 
cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule 
30 comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence encoded by said human cDNA clone. 
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A further preferred embodiment is a method for detecting in a biological 
sample a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 

5 wherein X is any integer as defined in Table 1; and a nucleotide sequence encoded by 
a human cDN A clone identified by a cDN A Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1; which method comprises a step of comparing a nucleotide sequence of at least one 
nucleic acid molecule in said sample with a sequence selected from said group and 

10 determining whether the sequence of said nucleic acid molecule in said sample is at 
least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence 

15 selected from said group. Similarly, also preferred is the above method wherein said 
step of comparing sequences is performed by comparing the nucleotide sequence 
determined from a nucleic acid molecule in said sample with said sequence selected 
from said group. The nucleic acid molecules can comprise DNA molecules or RNA 
molecules. 

20 A further preferred embodiment is a method for identifying the species, tissue 

or cell type of a biological sample which method comprises a step of detecting nucleic 
acid molecules in said sample, if any, comprising a nucleotide sequence that is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 

25 wherein X is any integer as defined in Table 1; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1. 

The method for identifying the species, tissue or cell type of a biological 
30 sample can comprise a step of detecting nucleic acid molecules comprising a 

nucleotide sequence in a panel of at least two nucleotide sequences, wherein at least 
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one sequence in said panel is at least 95% identical to a sequence of at least 50 

contiguous nucleotides in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene encoding a secreted 
5 protein identified in Table 1, which method comprises a step of detecting in a 

biological sample obtained from said subject nucleic acid molecules, if any, 

comprising a nucleotide sequence that is at least 95% identical to a sequence of at 

least 50 contiguous nucleotides in a sequence selected from the group consisting of: a 

nucleotide sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1 ; 
10 and a nucleotide sequence encoded by a human cDNA clone identified by a cDNA 

Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit 

Number shown for said cDN A clone in Table 1 . 

The method for diagnosing a pathological condition can comprise a step of 

detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at 
15 least two nucleotide sequences, wherein at least one sequence in said panel is at least 

95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 

selected from said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 

molecules wherein the nucleotide sequences of said nucleic acid molecules comprise 
20 a panel of at least two nucleotide sequences, wherein at least one sequence in said 

panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a 

sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 

NO:X wherein X is any integer as defined in Table 1 ; and a nucleotide sequence 

encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
25 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 

clone in Table 1. The nucleic acid molecules can comprise DNA molecules or RNA 

molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
30 amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 . 
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Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO: Y in the range of 
positions beginning with the residue at about the position of the First Amino Acid of 
the Secreted Portion and ending with the residue at about the Last Amino Acid of the 
5 Open Reading Frame as set forth for SEQ ID NO: Y in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
10 sequence at least 95% identical to a sequence of at least about 100 contiguous amino 
acids in the amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to the complete amino acid sequence of SEQ ID 
NO:Y. 

15 Further preferred is an isolated polypeptide comprising an amino acid 

sequence at least 90% identical to a sequence of at least about 10 contiguous amino 
acids in the complete amino acid sequence of a secreted protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

20 Also preferred is a polypeptide wherein said sequence of contiguous amino 

acids is included in the amino acid sequence of a secreted portion of the secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 

cDNA clone in Table 1 . 

25 Also preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

30 Also preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 100 contiguous amino acids in 
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the amino acid sequence of the secreted portion of the protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
5 at least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . 

Further preferred is an isolated antibody which binds specifically to a 
10 polypeptide comprising an amino acid sequence that is at least 90% identical to a 

sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone .Identifier in Table 1 and contained 
15 in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1. 

Further preferred is a method for detecting in a biological sample a 
polypeptide comprising an amino acid sequence which is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 

20 consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1 ; which method comprises a step of comparing an amino acid sequence of at least 

25 one polypeptide molecule in said sample with a sequence selected from said group 
and determining whether the sequence of said polypeptide molecule in said sample is 
at least 90% identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 

30 selected from said group comprises determining the extent of specific binding of 

polypeptides in said sample to an antibody which binds specifically to a polypeptide 
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comprising an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: 
an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in 
Table 1; and a complete amino acid sequence of a protein encoded by a human cDNA 
5 clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is the above method wherein said step of comparing sequences 
is performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 
10 Also preferred is a method for identifying the species, tissue or cell type of a 

biological sample which method comprises a step of detecting polypeptide molecules 
in said sample, if any, comprising an amino acid sequence that is at least 90% 
identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is 
15 any integer as defined in Table 1; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDN A Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 

cDNA clone in Table 1. 

Also preferred is the above method for identifying the species, tissue or cell 
20 type of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

25 Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Table 1, which method comprises a step of detecting in a 
biological sample obtained from said subject polypeptide molecules comprising an 
amino acid sequence in a panel of at least two amino acid sequences, wherein at least 

30 one sequence in said panel is at least 90% identical to a sequence of at least 10 

contiguous amino acids in a sequence selected from the group consisting of: an amino 
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acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1. 

In any of these methods, the step of detecting said polypeptide molecules 
includes using an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at 
least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO: Y 
wherein Y is any integer as defined in Table 1; and a complete amino acid sequence 
of a secreted protein encoded by a human cDNA clone identified by a cDNA Clone 
Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said 
polypeptide in a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO.Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1. 

Further preferred is a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred 
is the recombinant vector produced by this method. Also preferred is a method of 
making a recombinant host cell comprising introducing the vector into a host cell, as 
well as the recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 
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expressed and recovering said polypeptide. Also preferred is this method of making 
an isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and 
said polypeptide is a secreted portion of a human secreted protein comprising an 
amino acid sequence selected from the group consisting of: an amino acid sequence of 

5 SEQ ID NO: Y beginning with the residue at the position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and 
said position of the First Amino Acid of the Secreted Portion of SEQ ID NO:Y is 
defined in Table 1 ; and an amino acid sequence of a secreted portion of a protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 

10 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . The isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an 
increased level of a secreted protein activity, which method comprises administering 
to such an individual a pharmaceutical composition comprising an amount of an 

15 isolated polypeptide, polynucleotide, or antibody of the claimed invention effective to 
increase the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

20 

Examples 

Example 1: Isolation of a Selected cDNA Clone From the De posited Sample 

Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. 

25 Table 1 identifies the vectors used to construct the cDNA library from which each 
clone was isolated. In many cases, the vector used to construct the library is a phage 
vector from which a plasmid has been excised. The table immediately below 
correlates the related plasmid for each phage vector used in constructing the cDNA 
library. For example, where a particular clone is identified in Table 1 as being 

30 isolated in the vector "Lambda Zap," the corresponding deposited clone is in 
"pBluescript." 
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Vector Used to Construct Library 
Plasmid 



Corresponding Deposited 



Lambda Zap 
Uni-Zap XR 
Zap Express 



pBluescript (pBS) 
pBluescript (pBS) 
pBK 

plafmid BA 
pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 



lafmid BA 



pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 



Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et aL, Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M. A. et al., Strategies 5:58-61 (1992)) are 
commercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey 
Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Both can be transformed into E. coli strain 
XL-1 Blue, also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS+ 
and KS. The S and K refers to the orientation of the poly linker to the T7 and T3 
primer sequences which flank the polylinker region ("S" is for SacI and "K" is for 
Kpnl which are the first sites on each respective end of the linker). H +" or "-" refer to 
the orientation of the f 1 origin of replication ("ori"), such that in one orientation, 
single stranded rescue initiated from the f 1 ori generates sense strand DNA and in the 
other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 
DH10B, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al., Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, 
NY) contains an ampicillin resistance gene and can be transformed into E. coli strain 
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XL-1 Blue. Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday 
Avenue, Carlsbad, CA 92008, contains an ampicillin resistance gene and may be 
transformed into E. coli strain DH10B, available from Life Technologies. (See, for 
instance, Clark, J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., 

5 Bio/Technology 9: (1991).) Preferably, a polynucleotide of the present invention 
does not comprise the phage vector sequences identified for the particular clone in 
Table 1, as well as the corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number 
cited in Table 1 for any given cDNA clone also may contain one or more additional 

10 plasmids, each comprising a cDNA clone different from that given clone. Thus, 

deposits sharing the same ATCC Deposit Number contain at least a plasmid for each 
cDNA clone identified in Table 1. Typically, each ATCC deposit sample cited in 
Table 1 comprises a mixture of approximately equal amounts (by weight) of about 50 
plasmid DNAs, each containing a different cDNA clone; but such a deposit sample 

15 may include plasmids for more or less than 50 cDNA clones, up to about 500 cDNA 
clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone in Table 1 . First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ 
20 ID NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis et al., Molecular 

25 Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1.5% agar plates (containing the appropriate selection 

30 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
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bacterial colony screening (e.g., Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
1.104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
5 SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3* NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain 
reaction is carried out under routine conditions, for instance, in 25 nl of reaction 
mixture with 0.5 ug of the above cDNA template. A convenient reaction mixture is 
10 1.5-5 mM MgCl 2 , 0.01% (w/v) gelatin, 20 uM each of dATP, dCTP, dGTP, dTTP, 25 
pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 
(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 
min) are performed with a Perkin-Elmer Cetus automated thermal cycler. The 
amplified product is analyzed by agarose gel electrophoresis and the DNA band with 

1 5 expected molecular weight is excised and purified. The PCR product is verified to be 
the selected sequence by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3" non-coding 
portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 

20 and protocols similar or identical to 5' and 3' "RACE" protocols which are well 
known in the art. For instance, a method similar to 5' RACE is available for 
generating the missing 5' end of a desired full-length transcript. (Fromont-Racine et 
al., Nucleic Acids Res. 21(7): 1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5* ends of a 

25 population of RNA presumably containing full-length gene RNA transcripts. A 
primer set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the gene of interest is used to PCR amplify 
the 5' portion of the desired full-length gene. This amplified product may then be 
sequenced and used to generate the full length gene. 

30 This above method starts with total RNA isolated from the desired source, 

although poly-A+ RNA can be used. The RNA preparation can then be treated with 
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phosphatase if necessary to eliminate 5* phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should 
then be inactivated and the RNA treated with tobacco acid pyrophosphatase in order 
to remove the cap structure present at the 5* ends of messenger RNAs. This reaction 

5 leaves a 5* phosphate group at the 5' end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific 

10 to the ligated RNA oligonucleotide and a primer specific to the known sequence of 
the gene of interest. The resultant product is then sequenced and analyzed to confirm 
that the 5' end sequence belongs to the desired gene. 

Example 2: Isolation of Genomic Clones Corresponding to a Polynucleotide 

15 A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 

using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polypeptide 

20 Tissue distribution of mRNA expression of polynucleotides of the present 

invention is determined using protocols for Northern blot analysis, described by, 
among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P 32 using the rediprime™ DNA labeling 
system (Amersham Life Science), according to manufacturer's instructions. After 

25 labeling, the probe is purified using CHROMA SPIN- 100™ column (Clontech 
Laboratories, Inc.), according to manufacturer's protocol number PT 1200-1. The 
purified labeled probe is then used to examine various human tissues for mRNA 
expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) 
30 or human immune system tissues (IM) (Clontech) are examined with the labeled 
probe using ExpressHyb™ hybridization solution (Clontech) according to 
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manufacturer's protocol number FT 1 190-1. Following hybridization and washing, the 
blots are mounted and exposed to film at -70°C overnight, and the films developed 
according to standard procedures. 

5 Example 4: Chromosomal Mappin g of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5* 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
conditions : 30 seconds, 95°C; 1 minute, 56°C; 1 minute, 70°C. This cycle is 

10 repeated 32 times followed by one 5 minute cycle at 70°C. Human, mouse, and 

hamster DNA is used as template in addition to a somatic cell hybrid panel containing 
individual chromosomes or chromosome fragments (Bios, Inc). The reactions is 
analyzed on either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome 
mapping is determined by the presence of an approximately 100 bp PCR fragment in 

1 5 the particular somatic cell hybrid. 

Example 5: Bacterial Expression of a Polypeptide 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 

20 sequence, as outlined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5 1 end of the primers in order to clone the amplified product 
into the expression vector. For example, BamHI and Xbal correspond to the 
restriction enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., 

25 Chatsworth, CA). This plasmid vector encodes antibiotic resistance (Amp 1 ), a 

bacterial origin of replication (ori), an IPTG-regulatable promoter/operator (P/O), a 
ribosome binding site (RBS), a 6-histidine tag (6-His), and restriction enzyme cloning 
sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified 
30 fragment is ligated into the pQE-9 vector maintaining the reading frame initiated at 
the bacterial RBS. The ligation mixture is then used to transform the E. coli strain 
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M15/rep4 (Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which 
expresses the lad repressor and also confers kanamycin resistance (Kan r )- 
Transformants are identified by their ability to grow on LB plates and 
ampicillin/kanamycin resistant colonies are selected. Plasmid DNA is isolated and 

5 confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1: 100 to 1:250. The 
cells are grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG 

10 (Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 
mM. IPTG induces by inactivating the lad repressor, clearing the P/O leading to 
increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 

15 agent 6 Molar Guanidine HC1 by stirring for 3-4 hours at 4°C. The cell debris is 

removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with 
high affinity and can be purified in a simple one-step procedure (for details see: The 

20 QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 
8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then 
washed with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is 
eluted with 6 M guanidine-HCl, pH 5. 

25 The purified protein is then renatured by dialyzing it against phosphate- 

buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCL 
Alternatively, the protein can be successfully refolded while immobilized on the Ni- 
NTA column. The recommended conditions are as follows: renature using a linear 
6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 

30 containing protease inhibitors. The renaturation should be performed over a period of 
1.5 hours or more. After renaturation the proteins are eluted by the addition of 250 
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mM immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 
mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 
4° Cor frozen at -80° C. 

In addition to the above expression vector, the present invention further 
includes an expression vector comprising phage operator and promoter elements 
operatively linked to a polynucleotide of the present invention, called pHE4a. (ATCC 
Accession Number 209645, deposited on February 25, 1998.) This vector contains: 
1) a neomycinphosphotransferase gene as a selection marker, 2) an E. coli origin of 
replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 
Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). The 
origin of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The 
promoter sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating 
the larger fragment (the stuffer fragment should be about 310 base pairs). The DNA 
insert is generated according to the PCR protocol described in Example 1, using PCR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3' primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Example 6: Purification of a Pol ypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide 
expressed in E coli when it is present in the form of inclusion bodies. Unless 
otherwise specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10°C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
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Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 

5 is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 

10 pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 
overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA 

15 by vigorous stirring. The refolded diluted protein solution is kept at 4°C without 
mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0. 16 Jim membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 

20 filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 280 nm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 

25 Fractions containing the polypeptide are then pooled and mixed with 4 

volumes of water. The diluted sample is then loaded onto a previously prepared set of 
tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak 
anion (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are 
equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 

30 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using 
a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
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acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are 
collected under constant monitoring of the effluent. Fractions containing the 
polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the 
above refolding and purification steps. No major contaminant bands should be 
observed from Gommassie blue stained 16% SDS-PAGE gel when 5 jig of purified 
protein is loaded. The purified protein can also be tested for endotoxin/LPS 
contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL 
assays. 

Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a 
polynucleotide into a baculovirus to express a polypeptide. This expression vector 
contains the strong polyhedrin promoter of the Autographa calif ornica nuclear 
polyhedrosis virus (AcMNPV) followed by convenient restriction sites such as 
BamHI, Xba I and Asp718. The polyadenylation site of the simian virus 40 ("SV40") 
is used for efficient polyadenylation. For easy selection of recombinant virus, the 
plasmid contains the beta-galactosidase gene from E. coli under control of a weak 
Drosophila promoter in the same orientation, followed by the polyadenylation signal 
of the polyhedrin gene. The inserted genes are flanked on both sides by viral 
sequences for cell-mediated homologous recombination with wild-type viral DNA to 
generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, 
as long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
required. Such vectors are described, for instance, in Luckow et al., Virology 170:31- 
39(1989). 

Specifically, the cDNA sequence contained in the deposited clone, including 
the AUG initiation codon and the naturally associated leader sequence identified in 
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Table 1, is amplified using the PCR protocol described in Example 1. If the naturally 
occurring signal sequence is used to produce the secreted protein, the pA2 vector does 
not need a second signal peptide. Alternatively, the vector can be modified (pA2 GP) 
to include a baculovirus leader sequence, using the standard methods described in 
5 Summers et aL, "A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures," Texas Agricultural Experimental Station Bulletin No. 1555 
(1987). 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 
10 then is digested with appropriate restriction enzymes and again purified on a 1% 
agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DN A is then isolated from a 1 % agarose gel using a 

15 commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. coli HB101 or other suitable E. coli hosts such as XL-1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are identified 

20 by digesting DNA from individual colonies and analyzing the digestion product by 
gel electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

Five |ig of a plasmid containing the polynucleotide is co-transfected with 1.0 
fig of a commercially available linearized baculovirus DNA ("BaculoGold™ 

25 baculovirus DNA", Pharmingen, San Diego, CA), using the lipofection method 

described by Feigner et aL, Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ^g 
of BaculoGold™ virus DNA and 5 fig of the plasmid are mixed in a sterile well of a 
microtiter plate containing 50 |il of serum-free Grace's medium (Life Technologies 
Inc., Gaithersburg, MD). Afterwards, 10 |il Lipofectin plus 90 jxl Grace's medium are 

30 added, mixed and incubated for 15 minutes at room temperature. Then the 

transfection mixture is added drop-wise to Sf9 insect cells (ATCC CRL 171 1) seeded 
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in a 35 mm tissue culture plate with 1 ml Grace's medium without serum. The plate is 
then incubated for 5 hours at 27° C The transfection solution is then removed from 
the plate and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum 
is added. Cultivation is then continued at 27° C for four days. 
5 After four days the supernatant is collected and a plaque assay is performed, 

as described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of 
a "plaque assay" of this type can also be found in the user's guide for insect cell 

10 culture and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 
9-10.) After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 200 \i\ of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded 

15 in 35 mm dishes. Four days later the supernatants of these culture dishes are 
harvested and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with 
the recombinant baculovirus containing the polynucleotide at a multiplicity of 

20 infection ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the 
medium is removed and is replaced with SF900 II medium minus methionine and 
cysteine (available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 
\id of 35 S-methionine and 5 ^iCi 35 S-cysteine (available from Amersham) are added. 
The cells are further incubated for 16 hours and then are harvested by centrifugation. 

25 The proteins in the supernatant as well as the intracellular proteins are analyzed by 
SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be used to determine the amino terminal sequence of the 
produced protein. 

30 Example 8: Expression of a Polypeptide in Mammalian Cells 
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The polypeptide of the present invention can be expressed in a mammalian 
cell. A typical mammalian expression vector contains a promoter element, which 
mediates the initiation of transcription of mRNA, a protein coding sequence, and 
signals required for the termination of transcription and polyadenylation of the 
5 transcript. Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for RNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from S V40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLVI, HIVI and the early 
promoter of the cytomegalovirus (GMV). However, cellular elements can also be 

10 used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells that could be used 

15 include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, 
Cos 7 and CV1, quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 

Alternatively, the polypeptide can be expressed in stable cell lines containing 
the polynucleotide integrated into a chromosome. The co-transfection with a 

20 selectable marker such as dhfr, gpt, neomycin, hygromycin allows the identification 
and isolation of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 

25 interest. (See, e.g., Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, J. 
L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker 
is the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 
(1991); Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, 

30 the mammalian cells are grown in selective medium and the cells with the highest 

resistance are selected. These cell lines contain the amplified gene(s) integrated into a 
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chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al,, ^Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al., Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3* intron, the 
polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
DHFR gene under control of the S V40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the 
protocol outlined in Example 1. If the naturally occurring signal sequence is used to 
produce the secreted protein, the vector does not need a second signal peptide. 
Alternatively, if the naturally occurring signal sequence is not used, the vector can be 
modified to include a heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 
then is digested with appropriate restriction enzymes and again purified on a 1% 
agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. E. coli HB 101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 
transfection. Five p.g of the expression plasmid pC6 is cotransfected with 0.5 ^g of 
the plasmid pS Vneo using lipofectin (Feigner et al., supra). The plasmid pSV2-neo 
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contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 
that confers resistance to a group of antibiotics including G418. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
5 minus MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 
petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 
100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
10 concentrations of methotrexate (1 jiM, 2 |iM, 5 jiM, 10 mM, 20 mM). The same 

procedure is repeated until clones are obtained which grow at a concentration of 100 - 
200 jiM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 

15 Example 9: Protein Fusions 

The polypeptides of the present invention are preferably fused to other 
proteins. These fusion proteins can be used for a variety of applications. For 
example, fusion of the present polypeptides to His-tag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 5; see 

20 also EP A 394,827; Traunecker, et aL, Nature 331:84-86 (1988).) Similarly, fusion to 
IgG-1, IgG-3, and albumin increases the halflife time in vivo. Nuclear localization 
signals fused to the polypeptides of the present invention can target the protein to a 
specific subcellular localization, while covalent heterodimer or homodimers can 
increase or decrease the activity of a fusion protein. Fusion proteins can also create 

25 chimeric molecules having more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein compared to the non-fused 
protein. All of the types of fusion proteins described above can be made by 
modifying the following protocol, which outlines the fusion of a polypeptide to an 
IgG molecule, or the protocol described in Example 5. 

30 Briefly, the human Fc portion of the IgG molecule can be PCR amplified, 

using primers that span the 5' and 3' ends of the sequence described below. These 
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primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion 
can be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
5 destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PCR protocol described in Example 1, is ligated into this BamHI site. Note 
that the polynucleotide is cloned without a stop codon, otherwise a fusion protein will 
not be produced. 

10 If the naturally occurring signal sequence is used to produce the secreted 

protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

15 Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGC 
CCAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAA 
CCCAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGT 
GGTGGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGG 

20 ACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTA 
CAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACT 
GGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCA 
ACCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 
CACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAG 

25 GTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGT 
GGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCT 
CCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTG 
GACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCA 
TGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

30 GTAAATGAGTGCG ACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1 ) 
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Example 10: Production of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter 2.) For example, cells expressing a 
polypeptide of the present invention is administered to an animal to induce the 
5 production of sera containing polyclonal antibodies. In a preferred method, a 

preparation of the secreted protein is prepared and purified to render it substantially 
free of natural contaminants. Such a preparation is then introduced into an animal in 
order to produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 

10 monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et aL, Nature 
256:495 (1975); Kohler et aL, Eur. J. Immunol. 6:51 1 (1976); Kohler et al., Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 

15 involve immunizing an animal (preferably a mouse) with polypeptide or, more 

preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 
about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 

20 1,000 U/ml of penicillin, and about 100 |Xg/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 

25 selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma 
cells obtained through such a selection are then assayed to identify clones which 
secrete antibodies capable of binding the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can 

30 be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
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possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 
5 whose ability to bind to the protein-specific antibody can be blocked by the 
polypeptide. Such antibodies comprise anti-idiotypic antibodies to the protein- 
specific antibody and can be used to immunize an animal to induce formation of 
further protein-specific antibodies. 

It will be appreciated that Fab and F(ab , )2 and other fragments of the 

10 antibodies of the present invention may be used according to the methods disclosed 
herein. Such fragments are typically produced by proteolytic cleavage, using 
enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 
fragments). Alternatively, secreted protein-binding fragments can be produced 
through the application of recombinant DNA technology or through synthetic 

15 chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric antibodies are known in 
20 the art. (See, for review, Morrison, Science 229: 1202 (1985); Oi et al., 

BioTechniques 4:214 (1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et 
al., EP 171496; Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson 
et al., WO 8702671; Boulianne et al., Nature 312:643 (1984); Neuberger et al., Nature 
314:268(1985).) 

25 

Example 11; Production Of S ecreted Protein For High-Throughput Screening 
Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
30 Examples 13-20. 
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First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(lmg/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for 
a working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well 
plates) and incubate at RT for 20 minutes. Be sure to distribute the solution over each 
5 well (note: a 12-channel pipetter may be used with tips on every other channel). 

Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered 
Saline). The PBS should remain in the well until just prior to plating the cells and 
plates may be poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 5 cells/well in .5ml 

10 DMEM(Dulbecco*s Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/ lx 
Penstrep(17-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 

15 With a small volume multi-channel pipetter, aliquot approximately 2ug of an 
expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 8 or 9, into an appropriately labeled 96-well round bottom 
plate. With a multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I 
mixture to each well. Pipette up and down gently to mix. Incubate at RT 15-45 

20 minutes. After about 20 minutes, use a multi-channel pipetter to add 150ul Optimem 
I to each well. As a control, one plate of vector DNA lacking an insert should be 
transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 

25 much time on PBS. First, person A aspirates off the media from four 24-well plates 
of cells, and then person B rinses each well with .5- 1ml PBS. Person A then aspirates 
off PBS rinse, and person B, using al2-channel pipetter with tips on every other 
channel, adds the 200ul of DNA/Lipofectamine/Optimem I complex to the odd wells 
first, then to the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 

30 hours. 
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While cells are incubating, prepare appropriate media, either 1%BSA in 
DMEM with lx penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 
mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeS0 4 -7H 2 0; 
3 1 1.80 mg/L of Kcl; 28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of 
5 NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 71 .02 mg/L of 

Na 2 HP04; .4320 mg/L of ZnS0 4 -7H 2 0; :002 mg/L of Arachidonic Acid ; 1.022 mg/L 
of Cholesterol; .070 mg/L of DL-alpha-Tocopherol-Acetate; 0.0520 mg/L of Linoleic 
Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of 
Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
10 Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 4551 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
of L-Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 
2HCL-H,0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 

15 H 2 0; 106.97 mg/ml of L-Isoleucine; 1 1 1 .45 mg/ml of L-Leucine; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1.78 mg/L of 

20 Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.319 
mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B, 2 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na 
Hypoxanthine; 0. 105 mg/L of Lipoic Acid; 0.08 1 mg/L of Sodium Putrescine-2HCL; 

25 55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of 

Ethanolamine; 0. 122 mg/L of Ferric Citrate; 41 .70 mg/L of Methyl-B-Cyclodextrin 
complexed with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed 
with Oleic Acid; and 10 mg/L of Methyl-B-Cyclodextrin complexed with Retinal) 
with 2mm glutamine and lx penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L 

30 DMEM for a 10% BSA stock solution). Filter the media and collect 50 ul for 
endotoxin assay in 15ml polystyrene conical. 
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The transfection reaction is terminated, preferably by tag-teaming, at the end 
of the incubation period. Person A aspirates off the transfection media, while person 
B adds 1.5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 
5 On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml 

deep well plate and the remaining supernatant into a 2ml deep well. The supernatants 
from each well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the 
10 polypeptide directly (e.g., as a secreted protein) or by the polypeptide inducing 

expression of other proteins, which are then secreted into the supernatant. Thus, the 
invention further provides a method of identifying the protein in the supernatant 
characterized by an activity in a particular assay. 

15 Example 12: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jaks-STATs pathway. Activated proteins in the 
Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive responsive element ("ISRE")* located in the promoter of many genes. The 

20 binding of a protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many cell types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 

25 many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. Stat5 was originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in 
tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 

30 upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
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Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621- 
51 (1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, EL-7, IL-9, DL-1 1, IL- 
12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) 
Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a conserved 
cysteine motif (a set of four conserved cysteines and one tryptophan) and a WSXWS 
motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines 
are known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using 
GAS elements linked to reporter molecules, activators of the Jaks-STATs pathway 
can be identified. 
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+ 


- 


- 


1,2,3 


IFN-g 




+ 


+ 


- 


1 


GAS (IRFl>Lys6>IFP) 


11-10 


+ 


? 
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+ 
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? 
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? 


+ 


+ 


? 
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-/+ 


+ 


+ 


7 
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7 


? 
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+ 
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EGF 
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PDGF 


7 
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+ 
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CSF-1 


7 
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+ 
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GAS (not IRF1) 
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To construct a synthetic GAS containing promoter element, which is used in 
the Biological Assays described in Examples 13-14, a PCR based strategy is 
employed to generate a GAS-SV40 promoter sequence. The 5' primer contains four 
tandem copies of the GAS binding site found in the IRF1 promoter and previously 
demonstrated to bind STATs upon induction with a range of cytokines (Rothman et 
aL, Immunity 1:457-468 (1994).), although other GAS or ISRE elements can be used 
instead. The 5' primer also contains 18bp of sequence complementary to the SV40 
early promoter sequence and is flanked with an Xhol site. The sequence of the 5' 
primer is: 

5 ' : GCGCCTCG AG ATTTCCCCG A A ATCT AG ATTTCCCCG A A ATG ATTTCCCC 
GAAATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQ ID NO:3) 

The downstream primer is complementary to the S V40 promoter and is 
flanked with a Hind III site: 5^GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' 
(SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
the B -gal promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

5 * :CTCGAG ATTTCCCCG A A ATCT AG ATTTCCCCG AA ATG ATTTCCC 
TGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCG 
CCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCT 
CCGCCCCATGGCTGACTAATTTTTTTTA 

TCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCT 
AGGCTTTTGC A A A AAGCTT : 3 ' (SEQ ID NO:5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted 
alkaline phosphatase, or "SEAP." Clearly, however, any reporter molecule can be 
instead of SEAP, in this or in any of the other Examples. Well known reporter 
molecules that can be used instead of SEAP include chloramphenicol 



WO 99/47540 



PCT/US99/05804 



256 

acetyltransferase (CAT), luciferase, alkaline phosphatase, B-galactosidase, green 
fluorescent protein (GFP), or any protein detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 
5 Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 

promoter element, to create the GAS-SEAP vector. However, this vector does not 
contain a neomycin resistance gene, and therefore, is not preferred for mammalian 
expression systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 

10 SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 

15 as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 
containing NFK-B and EGR promoter sequences are described in Examples 15 and 
16. However, many other promoters can be substituted using the protocols described 

20 in these Examples. For instance, SRE, EL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, II- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

25 

Example 13: High-Throughput Screening Assay for T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
30 Thus, factors that increase SEAP activity indicate the ability to activate the Jaks- 

STATS signal transduction pathway. The T-cell used in this assay is Jurkat T-cells 
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(ATCC Accession No. TIB- 152), although Molt-3 cells (ATCC Accession No. CRL- 
1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 
5 SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. 
Resistant colonies are expanded and then tested for their response to increasing 
concentrations of interferon gamma. The dose response of a selected clone is 
10 demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life 

15 Technologies) with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM 
containing 50 ul of DMRIE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 cells/ml. Then add 1ml of 1 x 10 7 cells in OPTI-MEM to T25 

20 flask and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% 
serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with 
supernatants containing a polypeptide as produced by the protocol described in 
25 Example 1 1. 

On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supernatants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
30 million cells) are required. 
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Transfer the cells to a triangular reservoir boat, in order to dispense the cells 
into a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 
200 ul of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
directly from the 96 well plate containing the supernatants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1, 1.0, 10 ng) 
is added to wells H9, H10, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are placed 
in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul 
samples from each well are then transferred to an opaque 96 well plate using a 12 
channel pipette. The opaque plates should be covered (using sellophene covers) and 

stored at -20°C until SEAP assays are performed according to Example 17. The 

plates containing the remaining treated cells are placed at 4°C and serve as a source 
of material for repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable transfected cells, which would be apparent to those of skill in the art. 

Example 14: High-Throughput Screening Assay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity by identifying 
factors, such as growth factors and cytokines, that may proliferate or differentiate 
myeloid cells. Myeloid cell activity is assessed using the GAS/SEAP/Neo construct 
produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathway. The myeloid cell 
used in this assay is U937, a pre-monocyte cell line, although TF-1, HL60, or KG1 
can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct 
produced in Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994, Cell 
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Growth & Differentiation, 5:259-265) is used. First, harvest 2xl0e 7 U937 cells and 
wash with PBS. The U937 cells are usually grown in RPMI 1640 medium containing 
10% heat-inactivated fetal bovine serum (FBS) supplemented with 100 units/ml 
penicillin and 100 mg/ml streptomycin. 

Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 
0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, T40 mM NaC175 mM 
KC1, 375 uM Na 2 HP04.7H 2 0, 1 mM MgCl 2 , and 675 uM CaCl 2 . Incubate at 37°C 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
resuspend in 10 ml complete medium and incubate at 37°C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10* cells (this is enough for ten 96- well 
plates assay) and wash with PBS. Suspend the cells in 200 ml above described 
growth medium, with a final density of 5x10 s cells/ml. Plate 200 ul cells per well in 
the 96-well plate (or lxlO 5 cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 
11. Incubate at 37°C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon 
gamma can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant 
according to the protocol described in Example 17. 

Example 15; High-Throughput Screening Assay I d entifying Neuronal Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
EGR1 (early growth response gene 1), is induced in various tissues and cell types 
upon activation. The promoter of EGR1 is responsible for such induction. Using the 
EGR1 promoter linked to reporter molecules, activation of cells can be assessed. 
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Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TP A (tetradecanoyl 
phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). 
5 The EGR1 gene expression is activated during this treatment Thus, by stably 

transfecting PC 12 cells with a construct containing an EGR promoter linked to SEAP 
reporter, activation of PC 12 cells can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following 
protocol. The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al., Oncogene 
10 6:867-87 1 (1991)) can be PCR amplified from human genomic DNA using the 
following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:6) 
5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3 1 (SEQ ID NO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 
15 product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using restriction enzymes Xhol/Hindlll, removing the GAS/SV40 staffer. Restrict the 
EGR1 amplified product with these same enzymes. Li gate the vector and the EGR1 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
20 dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96- well plate, and 
allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
25 inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 
100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is 
done every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 
30 protocol described in Example 1 1 . EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 ug/ml G418. The G418-free medium is used for routine 
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growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
5 (Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing~l-% horse serum-and 0r5% FBS-with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5xl0 5 
10 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
lxlO 5 cells/well). Add 50 ul supernatant produced by Example 1 1, 37°C for 48 to 72 
nr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
15 induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16; High-Throughput Scre ening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide 
20 variety of agents including the inflammatory cytokines DL- 1 and TNF, CD30 and 
CD40, lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-kB 
regulates the expression of genes involved in immune cell activation, control of 
apoptosis (NF- kB appears to shield cells from apoptosis), B and T-cell development, 
25 anti-viral and antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 
(Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 
causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 
genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
30 class 1 MHC. 
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Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supernatants 
produced in Example 1 L Activators or inhibitors of NF-kB would be useful in 
treating diseases. For example, inhibitors of NF-kB could be used to treat those 
5 diseases related to the acute or chronic activation of NF-kB, such as rheumatoid 
arthritis. 

To construct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
10 to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5 ' :GCGGCCTCG AGGGG ACTTTCCCGGGG ACTTTCCGGGGACTTTCCGGG AC 
TTTCCATCCTGCCATCTCAATTAG:3 * (SEQIDNO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter 
and is flanked with a Hind III site: 
15 5^GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
20 sequence: 

5 * :CTCG AGGGG ACTTTCCCGGGG ACTTTCCGGGGACTTTCCGGGACTTTCC 
ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCC 
ATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGA 
25 CTAATTTTTTTTATTTATGCAGAGGCCGA 

TTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAG 
GCTT:3' (SEQ ID NO: 10) 

Next, replace the S V40 minimal promoter element present in the pSEAP2- 
30 promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and 
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Hindlll. However, this vector does not contain a neomycin resistance gene, and 
therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes 
5 Sail and NotI, and inserted into a vector containing neomycin resistance. Particularly, 
the NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the 
GFP gene, after restricting pGFP-1 with Sail and NotI. 

Once NF-kB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 13. Similarly, 
10 the method for assaying supernatants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
wells H9, H10, and HI 1, with a 5-10 fold activation typically observed. 

Example 17: Assay for SEAP Activity 

15 As a reporter molecule for the assays described in Examples 13-16, SEAP 

activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 
following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 jil of 2.5x 

20 dilution buffer into Optiplates containing 35 (il of a supernatant. Seal the plates with 
a plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid 
uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser 
and prime with the Assay Buffer. Add 50 |il Assay Buffer and incubate at room 

25 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 jxl Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at 
each time and start the second set 10 minutes later. 

30 Read the relative light unit in the luminometer. Set H12 as blank, and print 

the results. An increase in chemiluminescence indicates reporter activity. 
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Reaction Buffer Formulation: 


# of plates 


Rxn buffer diluent (ml) 


CSPD (ml) 


10 


60 


3 


11 


65 


3.25 


12 


70 


3.5 


13 


75 


3.75 


14 


80 


4 


15 


85 


4.25 


16 


90 


4.5 


17 


95 


4.75 


18 


100 


5 


19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 


23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 


7 


27 


145 


7.25 


28 


150 


7.5 


29 


155 


7.75 


30 


160 


8 


31 


165 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 




one 


10.25 


40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 



Example 18: High-Throughput Scr eening Assay Identifying Changes in Small 
Molecule Concentrat ion and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supernatants 
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which bind to receptors of a particular cell. Although the following protocol 
describes an assay for calcium, this protocol can easily be modified to detect changes 
in potassium, sodium, pH, membrane potential, or any other small molecule which is 
detectable by a fluorescent probe. 
5 The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 

measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-4 (Molecular Probes, inc. ; 
catalog no. F-14202) , used here. 

10 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star 

black 96-well plate with clear bottom. The plate is incubated in a C0 2 incubator for 
20 hours. The adherent cells are washed two times in Biotek washer with 200 ul of 
HBSS (Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 
A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To 

15 load the cells with fluo-4 , 50 ul of 12 ug/ml fluo-4 is added to each well. The plate 
is incubated at 37°C in a C0 2 incubator for 60 min. The plate is washed four times in 
the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5x1 0 6 cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 

20 fluo-4 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
The tube is then placed in a 37°C water bath for 30-60 min. The cells are washed 
twice with HBSS, resuspended to lxlO 6 cells/ml, and dispensed into a microplate, 100 
ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed 
once in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final 

25 volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as 
fluo-4 . The supernatant is added to the well, and a change in fluorescence is 
detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
30 following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; 
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and (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an 
extracellular signaling event which has resulted in an increase in the intracellular 

Ca++ concentration. 

5 Kxample 19: Hi^h -Throughout Scre ening Assay Identifying Tyrosine Kinase 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range of mitogenic and metabolic growth 
10 factors including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. 
In addition there are a large family of RPTKs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor 
15 dimerization, resulting in transphosphorylation of the receptor subunits and activation 
of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include 
receptor associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and 
non-receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, 
members of which mediate signal transduction triggered by the cytokine superfamily 
20 of receptors (e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of 
activating tyrosine kinase signal transduction pathways are of interest. Therefore, the 
following protocol is designed to identify those novel human secreted proteins 
25 capable of activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
30 with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or 

polylysine (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. 
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Louis, MO) or 10% Matrigel purchased from Becton Dickinson (Bedford,MA), or 

calf serum, rinsed with PBS and stored at 4°C. Cell growth on these plates is assayed 
by seeding 5,000 cells/well in growth medium and indirect quantitation of cell 
number through use of alamarBlue as described by the manufacturer Alamar 
5 Biosciences, Inc. (Sacramento, CA) after 48 hr. Falcon plate covers #3071 from 
Becton Dickinson (Bedford,MA) are used to cover the Loprodyne Silent Screen 
Plates. Falcon Microtest III cell culture plates can also be used in some proliferation 
experiments. 

To prepare extracts, A43 1 cells are seeded onto the nylon membranes of 
10 Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in 
Example 1 1, the medium was removed and 100 ml of extraction buffer ((20 mM 
HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3VQ4, 2 mM 
15 Na4P207 and a cocktail of protease inhibitors (# 1836170) obtained from 

Boeheringer Mannheim (Indianapolis, IN) is added to each well and the plate is 
shaken on a rotating shaker for 5 minutes at 4°C. The plate is then placed in a 
vacuum transfer manifold and the extract filtered through the 0.45 mm membrane 
bottoms of each well using house vacuum. Extracts are collected in a 96-well 
20 catch/assay plate in the bottom of the vacuum manifold and immediately placed on 
ice. To obtain extracts clarified by centrifugation, the content of each well, after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 
4°C at 16,000 xg. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
25 methods of detecting tyrosine kinase activity are known, one method is described 
here. 

Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
30 PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
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PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in 
order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
5 ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM MnC^, 
0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 

components gently and preincubate the reaction mix at 30°C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 
10 The tyrosine kinase assay reaction is then terminated by adding 10 ul of 

120mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 

mixture to a microtiter plate (MTP) module and incubating at 37°C for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
15 Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37°C for one hour. Wash the well as 
above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
20 incubate at room temperature for at least 5 mins (up to 30 min). Measure the 

absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitated using an ELIS A reader and reflects the level of 
tyrosine kinase activity. 

25 Example 20: High-Throughput Screening Assay Identifying Phosphorylation 
Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
kinase activity described in Example 19, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also be 
30 used. For example, as described below one particular assay can detect tyrosine 
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phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
5 substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96- well ELISA 
plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are 
then rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G 
plates are then treated with 2 commercial monoclonal antibodies (lOOng/well) against 
10 Erk-1 

and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of 
the above described molecules.) After 3-5 rinses with PBS, the plates are stored at 
4°C until use. 

15 A431 cells are seeded at 20,000/well in a 96- well Loprodyne filterplate and 

cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supernatants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
filtered directly into the assay plate. 

20 After incubation with the extract for 1 hr at RT, the wells are again rinsed. As 

a positive control, a commercial preparation of MAP kinase (lOng/well) is used in 
place 

of A43 1 extract. Plates are then treated with a commercial polyclonal (rabbit) 
antibody (lug/ml) which specifically recognizes the phosphorylated epitope of the 
25 Erk-1 and Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard 
procedures. The bound polyclonal antibody is then quantitated by successive 
incubations with Europium-streptavidin and Europium fluorescence enhancing 
reagent in the Wallac DELFIA instrument (time-resolved fluorescence). An increased 
fluorescent signal over background indicates a phosphorylation. 



30 
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Example 21: Method of Determining Alterations in a Gene Corresponding to a 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from 
5 these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA 
is then used as a template for PCR, employing primers surrounding regions of interest 
in SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95°C for 30 
seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et aL, Science 252:706 (1991). 

10 PCR products are then sequenced using primers labeled at their 5' end with T4 

polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exons is also determined and 
genomic PCR products analyzed to confirm the results. PCR products harboring 
suspected mutations is then cloned and sequenced to validate the results; of the direct 

15 sequencing. 

PCR products is cloned into T-tailed vectors as described in Holton, T.A. and 
Graham, M.W., Nucleic Acids Research, 19:1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

20 Genomic rearrangements are also observed as a method of determining 

alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5- 
triphosphate (Boehringer Manheim), and FISH performed as described in Johnson, 
Cg. et aL, Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe 

25 is carried out using a vast excess of human cot-1 DNA for specific hybridization to 
the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 

30 Brattleboro, VT) in combination with a cooled charge-coupled device camera 

(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 
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et al., Genet. Anal. Tech. AppL, 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22: Method of Detec ting Abnormal Levels of a Polypeptide in a 
Biological Sample 

A polypeptide of the present invention can be detected in a biological sample, 
and if an increased or decreased level of the polypeptide is detected, this polypeptide 
is a marker for a particular phenotype. Methods of detection are numerous, and thus, 
it is understood that one skilled in the art can modify the following assay to fit their 
particular needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are 
either monoclonal or polyclonal and are produced by the method described in 
Example 10. The wells are blocked so that non-specific binding of the polypeptide to 
the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used 
to validate results. The plates are then washed three times with deionized or distilled 
water to remove unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
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curve, using serial dilutions of a control sample, and plot polypeptide concentration 
on the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard 
curve. 

5 

Example 23 : Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment with the 

10 secreted polypeptide alone), the site of delivery, the method of administration, the 

scheduling of administration, and other factors known to practitioners. The "effective 
amount" for purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
secreted polypeptide administered parenterally per dose will be in the range of about 1 

15 |ig/kg/day to 10 mg/kg/day of patient body weight, although, as noted above, this will 
be subject to therapeutic discretion. More preferably, this dose is at least 0.01 
mg/kg/day, and most preferably for humans between about 0.01 and 1 mg/kg/day for 
the hormone. If given continuously, the secreted polypeptide is typically 
administered at a dose rate of about 1 p.g/kg/hour to about 50 |ig/kg/hour, either by 1- 

20 4 injections per day or by continuous subcutaneous infusions, for example, using a 
mini-pump. An intravenous bag solution may also be employed. The length of 
treatment needed to observe changes and the interval following treatment for 
responses to occur appears to vary depending on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention 

25 are administered orally, rectally, parenterally, intracistemally, intravaginally, 

intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" 
refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to 

30 modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrasternal, subcutaneous and intraarticular injection and infusion. 
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The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
5 3,773,9 19, EP 58,48 1 ), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate 
(Sidman, U. et aL, Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl 
methacrylate) (R. Langeret aL, J. Biomed. Mater. Res. 15:167-277 (1981), and R. 
Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et al.) or 
poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release compositions 
10 also include liposomally entrapped polypeptides. Liposomes containing the secreted 
polypeptide are prepared by methods known per se: DE 3,218,121; Epstein et al., 
Proc. Natl. Acad. Sci. USA 82:3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. 
USA 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and 
15 EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) 
unilamellar type in which the lipid content is greater than about 30 mol. percent 
cholesterol, the selected proportion being adjusted for the optimal secreted 
polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
20 formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the formulation. 
For example, the formulation preferably does not include oxidizing agents and other 
25 compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 
30 of the recipient. Examples of such carrier vehicles include water, saline, Ringer's 
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solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances 
that enhance isotonicity and chemical stability. Such materials are non-toxic to 
5 recipients at the dosages and concentrations employed, and include buffers such as 
phosphate, citrate, succinate, acetic acid, and other organic acids or their salts; 
antioxidants such as ascorbic acid; low molecular weight (less than about ten 
residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 

10 polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, or 

arginine; monosaccharides, disaccharides, and other carbohydrates including cellulose 
or its derivatives, glucose, manose, or dextrins; chelating agents such as EDTA; sugar 
alcohols such as mannitol or sorbitol; counterions such as sodium; and/or nonionic 
surfactants such as polysorbates, poloxamers, or PEG. 

15 The secreted polypeptide is typically formulated in such vehicles at a 

concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at apH of 
about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 

20 Sterility is readily accomplished by filtration through sterile filtration membranes 
(e.g., 0.2 micron membranes). Therapeutic polypeptide compositions generally are 
placed into a container having a sterile access port, for example, an intravenous 
solution bag or vial having a stopper pierceable by a hypodermic injection needle. 
Polypeptides ordinarily will be stored in unit or multi-dose containers, for 

25 example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 

formulation for reconstitution. As an example of a lyophilized formulation, 10-ml 
vials are filled with 5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, 
and the resulting mixture is lyophilized. The infusion solution is prepared by 
reconstituting the lyophilized polypeptide using bacteriostatic Water-for-Injection. 

30 The invention also provides a pharmaceutical pack or kit comprising one or 

more containers filled with one or more of the ingredients of the pharmaceutical 
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compositions of the invention. Associated with such container(s) can be a notice in 
the form prescribed by a governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, which notice reflects approval by the 
agency of manufacture, use or sale for human administration. In addition, the 
5 polypeptides of the present invention may be employed in conjunction with other 
therapeutic compounds. 

Example 24: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
10 normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted 
form. Thus, the invention also provides a method of treatment of an individual in 
need of an increased level of the polypeptide comprising administering to such an 
individual a pharmaceutical composition comprising an amount of the polypeptide to 
15 increase the activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

20 

Example 25: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such as 
25 cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 
2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest 
period if the treatment was well tolerated. The formulation of the antisense 
30 polynucleotide is provided in Example 23. 
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Example 26: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
5 separated into small pieces. Small chunks of the tissue are placed on a wet surface of 
a tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10% FBS, 

10 penicillin and streptomycin) is added. The flasks are then incubated at 37°C for 
approximately one week. 

At this time, fresh media is added and subsequently changed every several 
days. After an additional two weeks in culture, a monolayer of fibroblasts emerge. 
The monolayer is trypsinized and scaled into larger flasks. 

15 pMV-7 (Kirschmeier, P.T. et aL, DNA, 7:219-25 (1988)), flanked by the long 

terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindlll and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 

20 using PCR primers which correspond to the 5' and 3' end sequences respectively as set 
forth in Example 1 . Preferably, the 5' primer contains an EcoRI site and the 3* primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 

25 appropriate for ligation of the two fragments. The ligation mixture is then used to 

transform bacteria HB101, which are then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 

30 10% calf serum (CS), penicillin and streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells transduced with the vector. 
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The packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a millipore filter to remove 
- detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the 
media from the producer cells. This media is removed and replaced with fresh media. 
If the titer of virus is high, then virtually all fibroblasts will be infected and no 
selection is required. If the titer is very low, then it is necessary to use a retroviral 
vector that has a selectable marker, such as neo or his. Once the fibroblasts have been 
efficiently infected, the fibroblasts are analyzed to determine whether protein is 
produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 



Example 27: Method of Treatment Using Gene Therapy - In Vivo 

Another aspect of the present invention is using in vivo gene therapy methods 
to treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) 
sequences into an animal to increase or decrease the expression of the polypeptide. 
The polynucleotide of the present invention may be operatively linked to a promoter 
or any other genetic elements necessary for the expression of the polypeptide by the 
target tissue. Such gene therapy and delivery techniques and methods are known in 
the art, see, for example, WO90/11092, W098/11779; U.S. Patent NO. 5693622, 
5705151, 5580859; Tabata H. et al. (1997) Cardiovasc. Res. 35(3):470-479, Chao J et 
al. (1997) Pharmacol. Res. 35(6):5 17-522, Wolff J.A. (1997) Neuromuscul. Disord. 
7(5):314-318, Schwartz B. et al. (1996) Gene Ther. 3(5):405-41 1, Tsurumi Y. et al. 
(1996) Circulation 94(12):328 1-3290 (incorporated herein by reference). 
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The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial 
space of tissues (heart, muscle, skin, lung, liver, intestine and the like). The 
polynucleotide constructs can be delivered in a pharmaceutically acceptable liquid or 
5 aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are 
free from any delivery vehicle that acts to assist, promote, or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present 
10 invention may also be delivered in liposome formulations (such as those taught in 
Feigner P.L. et aL (1995) Ann. NY Acad. Sci. 772:126-139 and Abdallah B. et al. 
(1995) Biol. Cell 85(l):l-7) which can be prepared by methods well known to those 
skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 

15 preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in 
the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into 
target cells is the transitory nature of the polynucleotide synthesis in the cells. Studies 

20 have shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of 
tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 

25 stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue. Interstitial space of the tissues comprises the intercellular fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers 
in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same 
matrix within connective tissue ensheathing muscle cells or in the lacunae of bone. It 

30 is similarly the space occupied by the plasma of the circulation and the lymph fluid of 
the lymphatic channels. Delivery to the interstitial space of muscle tissue is preferred 
for the reasons discussed below. They may be conveniently delivered by injection 
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into the tissues comprising these cells. They are preferably delivered to and 
expressed in persistent, non-dividing cells which are differentiated, although delivery 
and expression may be achieved in non-differentiated or less completely 
differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. In 
vivo muscle cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA will be in the range of from about 0.05 g/kg body weight to about 50 mg/kg 
body weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 
mg/kg and more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as 
the artisan of ordinary skill will appreciate, this dosage will vary according to the 
tissue site of injection. The appropriate and effective dosage of nucleic acid sequence 
can readily be determined by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. The preferred route of 
administration is by the parenteral route of injection into the interstitial space of 
tissues. However, other parenteral routes may also be used, such as, inhalation of an 
aerosol formulation particularly for delivery to lungs or bronchial tissues, throat or 
mucous membranes of the nose. In addition, naked polynucleotide constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA, which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on 
the anterior thigh, and the quadriceps muscle is directly visualized. The template 
DNA is injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over 
one minute, approximately 0.5 cm from the distal insertion site of the muscle into the 
knee and about 0.2 cm deep. A suture is placed over the injection site for future 
localization, and the skin is closed with stainless steel clips. 
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After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 15 um cross-section of the 
individual quadriceps muscles is histochemically stained for protein expression. A 
time course for protein expression may be done in a similar fashion except that 
5 quadriceps from different mice are harvested at different times. Persistence of DNA 
in muscle following injection may be determined by Southern blot analysis after 
preparing total cellular DNA and HIRT supernatants from injected and control mice. 
The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked 
10 DNA. 

Example 28: Transgenic Animals, 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, 
15 guinea pigs, pigs, micro-pigs, goats, sheep, cows and non- human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate transgenic animals. In a 
specific embodiment, techniques described herein or otherwise known in the art, are 
used to express polypeptides of the invention in humans, as part of a gene therapy 
protocol. 

20 Any technique known in the art may be used to introduce the transgene (i.e., 

polynucleotides of the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not limited to, pronuclear 
microinjection (Paterson et al., Appl. Microbiol. Biotechnol. 40:691-698 (1994); 
Carver et al., Biotechnology (NY) 1 1:1263-1270 (1993); Wright et al., Biotechnology 

25 (NY) 9:830-834 (1991); and Hoppe et al., U.S. Pat. No. 4,873,191 (1989)); retrovirus 
mediated gene transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., 
USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic 
stem cells (Thompson et aL, Cell 56:313-321 (1989)); electroporation of cells or 
embryos (Lo, 1983, Mol Cell. Biol. 3:1803-1814 (1983)); introduction of the 

30 polynucleotides of the invention using a gene gun (see, e.g., Ulmer et al., Science 
259:1745 (1993); introducing nucleic acid constructs into embryonic pleuripotent 
stem cells and transferring the stem cells back into the blastocyst; and sperm- 
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mediated gene transfer (Lavitrano et al., Cell 57:717-723 (1989); etc. For a review of 
such techniques, see Gordon, "Transgenic Animals," Intl. Rev. Cytol. 115:171-229 
(1989), which is incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones 
5 containing polynucleotides of the invention, for example, nuclear transfer into 
enucleated oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to 
quiescence (Campell et aL, Nature 380:64-66 (1996); Wilmut et al., Nature 385:810- 
813(1997)). 

The present invention provides for transgenic animals that carry the transgene 

10 in all their cells, as well as animals which carry the transgene in some, but not all their 
cells, i.e., mosaic animals or chimeric. The transgene may be integrated as a single 
transgene or as multiple copies such as in concatamers, e.g., head-to-head tandems or 
head-to-tail tandems. The transgene may also be selectively introduced into and 
activated in a particular cell type by following, for example, the teaching of Lasko et 

15 al. (Lasko et al., Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The regulatory 
sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When 
it is desired that the polynucleotide transgene be integrated into the chromosomal site 
of the endogenous gene, gene targeting is preferred. Briefly, when such a technique is 

20 to be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene may also be selectively 
introduced into a particular cell type, thus inactivating the endogenous gene in only 

25 that cell type, by following, for example, the teaching of Gu et al. (Gu et al., Science 
265:103-106 (1994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent 
to those of skill in the art. 

Once transgenic animals have been generated, the expression of the 

30 recombinant gene may be assayed utilizing standard techniques. Initial screening 
may be accomplished by Southern blot analysis or PCR techniques to analyze animal 
tissues to verify that integration of the transgene has taken place. The level of mRNA 
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expression of the transgene in the tissues of the transgenic animals may also be 
assessed using techniques which include, but are not limited to, Northern blot analysis 
of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 
transcriptase-PCR (rt-PCR). Samples of transgenic gene-expressing tissue may also 
5 be evaluated immunocytochemically or immunohistochemically using antibodies 
specific for the transgene product 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more 

10 than one integration site in order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics that express the transgene at higher 
levels because of the effects of additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals homozygous for a given 
integration site in order to both augment expression and eliminate the need for 

15 screening of animals by DNA analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; and breeding to place the 
transgene on a distinct background that is appropriate for an experimental model of 
interest. 

Transgenic animals of the invention have uses which include, but are not 
20 limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating 
such conditions and/or disorders. 

25 Example 29: Knock-Out Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. (E.g., 
see Smithies et aL, Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503- 
512 (1987); Thompson et al., Cell 5:313-321 (1989); each of which is incorporated by 
30 reference herein in its entirety). For example, a mutant, non-functional 
polynucleotide of the invention (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous polynucleotide sequence (either the coding 
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regions or regulatory regions of the gene) can be used, with or without a selectable 
marker and/or a negative selectable marker, to transfect cells that express 
polypeptides of the invention in vivo. In another embodiment, techniques known in 
the art are used to generate knockouts in cells that contain, but do not express the gene 
5 of interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 
research and agricultural fields where modifications to embryonic stem cells can be 
used to generate animal offspring with an inactive targeted gene {e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). However this approach can be routinely 

10 adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 
will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered 
to express the polypeptides of the invention, or alternatively, that are genetically 

15 engineered not to express the polypeptides of the invention (e.g., knockouts) are 
administered to a patient in vivo. Such cells may be obtained from the patient (i.e., 
animal, including human) or an MHC compatible donor and can include, but are not 
limited to fibroblasts, bone marrow cells, blood cells ( e.g. . lymphocytes), adipocytes, 
muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using 

20 recombinant DNA techniques to introduce the coding sequence of poly peptides of the 
invention into the cells, or alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the polypeptides of the invention, 
e^g,, by transduction (using viral vectors, and preferably vectors that integrate the 
transgene into the cell genome) or transfection procedures, including, but not limited 

25 to, the use of plasmids, cosmids, YACs, naked DNA, electroporation, liposomes, etc. 
The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The 
engineered cells which express and preferably secrete the polypeptides of the 

30 invention can be introduced into the patient systemically, e.g., in the circulation, or 
intraperitoneally . 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e.g. , genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et al. U.S. Patent No. 5,399,349; and 
5 Mulligan & Wilson, U.S. Patent No. 5,460,959 each of which is incorporated by 
reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC 
compatible cells, they can be administered using well known techniques which 
prevent the development of a host immune response against the introduced cells. For 
10 example, the cells may be introduced in an encapsulated form which, while allowing 
for an exchange of components with the immediate extracellular environment, does 
not allow the introduced cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, 
but are not limited to, animal model systems useful in elaborating the biological 
15 function of polypeptides of the present invention, studying conditions and/or disorders 
associated with aberrant expression, and in screening for compounds effective in 
ameliorating such conditions and/or disorders. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. Numerous 
modifications and variations of the present invention are possible in light of the above 
teachings and, therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples 
is hereby incorporated herein by reference. Further, the hard copy of the sequence 
listing submitted herewith and the corresponding computer readable form are both 
incorporated herein by reference in their entireties. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A The indications made below relate to the microorganism referred to in the description 
on page 180 , line _ N/A 



B. IDENTIFICATIONOFDEPOSIT Further deposits are identified on an additional sheet j~| 

Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 

February 12, 1998 


Accession Number 

209628 


C ADDITIONAL INDICATIONS ( leave blank if not applicable) This information is continued on an additional sheet [ j 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS ( leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g. t "Accession 
Number of Deposit") 



0^ 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



[ | This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13 bis) 



A The indications made below relate to the microorganism referred to in the description 

on page ^83 ,line ; WA m 

B. IDENTIFICATION OFDEPOSIT Further deposits are identified on an additional sheet 

Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 



February 25, 1998 



Accession Number 



209641 



C ADDITIONAL INDICATIONS (tea blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are notfor all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A The indications made below relate to the microorganism referred to in the description 
on page 1§5 t | me N/A 



B. IDENTIFICATION OFDEPOSTT 



Further deposits are identified on an additional sheet [ ] 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 

March 4, 1998 


Accession Number 

209651 


C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | | 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADEtf the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable ) 

The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



| ^( ^This sheet was received with the international application 
> 



7 t 



Authorized officer 



For International Bureau use only 



I I This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having 
a nucleotide sequence at least 95% identical to a sequence selected from the group 
5 consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment 
of the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 
10 polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit 

No:Z, which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO.X; 

15 (d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO:Y or a 

polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO:Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 

20 having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID 

NO:Y; 

25 (i) a polynucleotide capable of hybridizing under stringent conditions to any 

one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 
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2. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted 
protein. 

3. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO: Y or the polypeptide encoded by the cDNA sequence 
included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 

4. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X 
or the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X. 

5. The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. 

6. The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1. 

9. A recombinant host cell produced by the method of claim 8. 



10. The recombinant host cell of claim 9 comprising vector sequences. 
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11. An isolated polypeptide comprising an amino acid sequence at least 
95% identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence 
5 included in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence 
included in ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

10 (d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included 

in ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO: Y or the encoded sequence included in 
1 5 ATCC Deposit No:Z; 

(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO: Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

12. The isolated polypeptide of claim 11, wherein the secreted form or the 
20 full length protein comprises sequential amino acid deletions from either the C- 

terminus or the N-terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide 
of claim 11. 



25 



14. A recombinant host cell that expresses the isolated polypeptide of 
claim 11. 



30 



15. A method of making an isolated polypeptide comprising: 
(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 
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(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount 
of the polypeptide of claim 1 1 or the polynucleotide of claim L 

18. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the polypeptide of claim 
1 1 comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 



2 1 . The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 
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22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 20. 
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<110> Human Genome Sciences, Inc., et al . 

<120> 95 Human Secreted Proteins 

<130> PZ027PCT 

<140> Unassigned 
<141> 1999, March 18 



<150> 60/078,566 
<151> 1998-03-19 

<150> 60/078,574 
<151> 1998-03-19 

<150> 60/078,576 
<151> 1998-03-19 

<150> 60/078,563 
<151> 1998-03-19 

<150> 60/078,573 
<151> 1998-03-19 

<150> 60/078,578 
<151> 1998-03-19 

<150> 60/078,579 
<151> 1998-03-19 

<150> 60/078,581 
<151> 1998-03-19 

<150> 60/078,577 
<151> 1998-03-19 

<150> 60/080,314 
<151> 1998-04-01 

<150> 60/080,312 
<151> 1998-04-01 

<150> 60/080,313 
<151> 1998-04-01 

<160> 392 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<400> 1 
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gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 120 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 

tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 

ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 360 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 

catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 

atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 600 

acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 720 

gactctagag gat 733 



<210> 2 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> Site 
<222> (3) 

<223> Xaa equals any of the twenty naturally ocurring L-amino acids 
<400> 2 

Trp Ser Xaa Trp Ser 
1 5 

<210> 3 

<211> 86 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 
cccgaaatat ctgccatctc aattag 



<210> 4 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 4 

gcggcaagct ttttgcaaag cctaggc 



<210> 5 

<211> 271 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 
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aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 

gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 

ttttggaggc ctaggctttt gcaaaaagct t 271 



<210> 6 
<211> 32 
<212> DNA 

<213> Homo sapiens 
<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 32 

<210> 7 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 31 



<210> 8 
<211> 12 
<212> DNA 

<213> Homo sapiens 
<400> 8 

ggggactttc cc 12 



<210> 9 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 
ccatctcaat tag 73 

<210> 10 
<211> 256 
<212> DNA 
<213> Homo sapiens 

<400> 10 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 

caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 

cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 

ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 
cttttgcaaa aagctt 



60 
120 
180 
240 
256 
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<210> 11 

<211> 2343 

<212> DNA 

<213> Homo sapiens 

<400> 11 

acgcgtccgg tttttcaaag gtttaactgt ccagggcaga tacttaagac tatctgatca 60 

tccattaaaa acttttcaca tagtcttgct taaatggatc cattatgttt acccattata 120 

ttgttctcag ctgtagtttt aagaaattta tttcatttgt taatacttac tttccattac 180 

cttccccttt tctgtgacaa tccgttgata cttgaagacc tctcttgtat tcatcttagg 240 

gttaatattt ttaaggccaa acagcctaaa ttctatggta atcaactcca gccttgtgta 300 

atgaaatctt ctgcataaag ataggtttaa atcaaatcag attgcagatt ttattgaaga 360 

aattgtgttt ttaagagttg acaaatatat gttgtatggc taaaacaaag aaaatacttc 420 

tgttgcttct gcatttagta gaagaaaaac tatatatgtt tgtgaccaaa gtataaaata 480 

tgattctttc cagggaggta aaggttatgc acaagatttt cactagcagc tctaaaaggc 540 

taccctcaat taattgccat gaacatttca tagccctaga aggatgtagg ctcatttcag 600 

tgtcatcctg gtttattctt tattgtatta ttcagcagtc attttaacac tatgctagac 660 

actttagaga ttcagaagag taacagggtt tctgttctca tgaagcttat caggagacag 720 

aaaacatatg aattagatct aattggaggc aaactgaaat atatagtgga gttagtgtgg 780 

ttatcagcac ataaatgagt gatccatcaa caaaaggaga aattgggagg gttttatggg 840 

ccaaaaacag catgattaaa tgtgatagag tatatgtcat gttttaggtg tgatgaacat 900 

tcagttatgt gtgacgaata ggataattga aaaaatatga aaggctatga tgccagaaag 960 

tattatggga caagatctta aaaccagtgt tacctaggga gtatgaattt aatatgggaa 1020 

ttcttaaact cctttatgac tggaagatga gcatcagagt gtctgcgacc attttgatga 1080 

tatgatgtac cagtttttaa atgtttggct ttttccaggt gatgaaagcg ggggatgagt 1140 

taagaaccac tgctgtgaag gattcacaac tatttttagg cagttgggta aaaatgacca 1200 

atttagtttt aagaaactga ctgtggctcc agagtatgtt ggagaagtga aaatggagac 12 60 

taggaataac aggtgggaga ctattagtct aattaagatg taattataaa tctaagctag 1320 

gaacgtaaaa tgagaatgca aagtaagaaa caaatatggg gaaaattata tgtaaaagta 1380 

ataggacttg gcatcttact gatgtgattg attatgagaa aaatgaagca tgtggaggag 1440 

tccactggac agtaggaaat tcagcctaag acttgggtaa gagttctgtg gagttgtgaa 1500 

ttcagaggcc agagatgtga tatttaaaat tttggttcaa gatttcccag gtataagaaa 1560 

gcaagaggat taaagcattg taattaaact ttaagcagtg catatttatg ttatagataa 1620 

gataaacaag aaatctaggg atcaaatagg attaaaatta gtagtgatca ttcagtacag 1680 

tagttacgta ctgttattca caagagtata taaatcaaat tacaaggaat taaggatata 1740 

aacgtgataa gaaagtatgc actgtactct ttgaggaagt ttgccataga aaggaagaag 1800 

aaataggatg gtagatcaga agtaaagcag gacccagtgg ggggagtgtt tgcagtgagg 1860 

cagtatgtat aatcatttaa aacatgggtt tggagtcctc tcaggttcca tgtttgtaat 1920 

ggacataatg ataataatcc ctttcattta aggctgttgt gaggattaaa tgtgttaatg 1980 

tgcaaataac tttacacagt gcctggtata taataaatgc ttgctaccta ttaactagta 2040 

tttgtttcta aggctaattt aagtcctaga attgattgca aggattagat caggagtata 2100 

gtggacatgt tgggatttaa atatttaaat atagagatgc tttttaggac cattgttaga 2160 

accagaagag attttttacc aagttcacac agaaatgtag gtgcattggc tgggcatggt 2220 

ggctcacacc tgcagtccca gcacttggga aggctgaggc agaagaactg cttgaggcca 2280 

acattttgag accagcctgg gcaacatatt aagaccccgt ctccaccaaa aaaaaaaaaa 23 40 

aaa ^ 2343 

<210> 12 

<211> 1177 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> SITE 
<222> (1095) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1115) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1142) 

<223> n equals a,t ( g, or c 
<220> 

<221> SITE 
<222> (1162) 

<223> n equals a,t,g, or c 



<400> 12 

agccaccatg cccggcctag attaaaaatt tgaagacata ttctctacta tgagccaatg 60 

aaattactca ttttgtttct atcccatttg ctgtcccttg cttttggaat tttgtgtctt 120 

agtgtgactg tgattctttc tctccttttg tctttcagca aacggggatt cagcgtccga 180 

tcctttggaa cagggactca cgtgaagctt ccaggaccag ctcccgacaa gcccaatgtt 240 

tatgatttca aaaccacata tgaccagatg tacaatgatc ttcttaggaa agacaaagaa 300 

ctctatacac agaatgggat tttacatatg ctggacagaa ataagagaat caagccccgg 360 

ccagaaagat tccagaactg caaagacctg tttgatctga tcctcacttg cgaagagaga 42 0 

gtgtatgacc aggtggtgga agatctgaat tccagagaac aggagacctg ccagccygtg 480 

cacgtggtca atgtggacat ccaggacaac cacgaggagg ccaccctggg ggcgtttctc 540 

atctgtgagc tctgccagtg tatccagcac acggaagaca tggagaacga gatcgacgag 600 

ctgctgcagg agttcgagga gaagagtggc cgcacctttc tgcacaccgt ctgcttctac 660 

tgagcccagc gcccgcatgg agccgcctct ggagcttcct gttgttcata ctttttcctt 720 

cctgacattt gtttttactt acaggtgttc tgctggtgac ggtagcatta cccaaataaa 780 

ctgtgcatat gaaatgggag aggagatgcc aaaacgccag atgaaagcaa tcaagtttct 840 

tcttttccac ttttacttat gagcrggata ttgattacaa agtttttctt ctttaaccaa 900 

aaaggaaaga caacggtttg tgtgcacttc ccgacatacc tgtgtcttcg tgtgcctgcc 960 

ttccctccct cctccccacc gggccggact gtacagagcc ctgctgcggc gtgttaggaa 1020 

tgacctggaa ttgtcaataa acagatgctg ctgtcaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

aaaaaaaaaa raaancaaaa aaaaaaaaaa aaggnggggc cgaaggtttt ttccctttgg 1140 

tnggggttat ttttggcttg gnattggcct tcgtttt 1177 



<210> 13 

<211> 2107 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (149) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (487) 
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<223> n equals a,t,g, or c 



<400> 13 

tttaggtatg catataaaag aaaacaaaat atttaaaaca cttaaaggag atctgacgaa 60 

acctaaagag aagaaaaata ataaaattaa gtaaagaaaw ggtatggcag gattttatgt 120 

ttgcctgtgc cctttatcca actgatcang gcttcctgga wtcagtgaca gcagaagttg 180 

cctaccagat caagagactg aaatctcatc cttctatcat catatggagt ggcaataatg 240 

aaaatgagga ggcgctgatg atgaattggt atcatatcag tttcactgac cggccaatct 300 

acatcaagga ctatgtgaca ctctatgtga aaaacatcag agagctcgta ctggcaggag 360 

acaagagtcg tccttttatt acgtccagtc ctacaaatgg ggctgaaact gttgcagaag 420 

cctgggtctc tcaaaaccct aatagcaatt attttggtga tgtacatttt tatgactata 480 

tcagtgnatt gctggaactg gaaagttttc ccaaaagctc gatttgcatc tgaatatgga 540 

tatcagtcct ggccgtcctt cagtacatta gaaaaggtct cgtctacaga ggactggtct 600 

ttcaatagca agttttcact tcatcgacaa catcacgaag gtggtaacaa acaaatgctt 660 

tatcaggctg gacttcattt caaactcccc caaagcacag atccattacg cacatttaaa 720 

gataccatct accttactca ggtgatgcag gcccagtgtg tcaaaacaga aactgaattc 780 

taccgccgta gtcgcagcga gatagtggat cagcaagggc acacgatggg ggcactttat 840 

tggcagttga atgacatctg gcaagctcct tcctgggctt ctcttgagta cggaggaaag 900 

tggaaaatgc ttcattactt tgctcagaat ttctttgctc cactgttgcc agtaggcttt 960 

gagaatgaaa acaygttcta tatctatggt gtgtcagatc ttcactcgga ttattcgatg 1020 

acactcagtg tgagagtcca tacatggagc tccctggagc ccgtgtgctc tcgtgtgact 1080 

gaacgttttg tgatgaaagg aggagaggct gtctgccttt atgaggagcc agtgtctgaa 1140 

ttgctgagga gatgtgggaa ttgcacacgg gaaagctgtg tggtttcctt ttacctttca 1200 

gctgaccatg aactcctgag cccgaccaac taccacttcy tgtcctcacc gaaggaggcc 12 60 

gtggggctct gcaaggcgca gatcactgcc atcatctctc agcaaggtga catatttgtt 1320 

tttgacctgg agacctcagc tgtcgctccc tttgtttggt tggatgtagg aagcatccca 13 80 

gggagattta gtgacaatgg tttcctcatg actgagaaga cacgaactat attattttac 1440 

ccttgggagc ccaccagcaa gaatgagttg gagcaatctt ttcatgtgac ctccttaaca 1500 

gatatttact gaaggaatct aggttgtatt ttcagtggac aatgggaata aagcatttct 1560 

aaagcaccga ctggagagga aggcaacaga gacaaggaga gaagccgaga gacatgtctg 1620 

cgtgctgcca cgcatctgag cgattgctct gtgaagagtt gtacactgaa cattttcagg 1680 

ggaggctgtt tacccaggca atgtcctcaa acaagcctgt gccggggtgt cctggaatct 1740 

gtgccaggac tgtgttttta gcccttcacc tctcagcttt agcaggacat gaaccagtta 1800 

taacaagatg sccctgcagc tggttacaag aatgtgacat ggcaggatct atggaaccaa 1860 

atggaaggtt ttgaggtgat gtaggtcttt cacagttagc tttggggaat acagaatact 1920 

caaataaagt gctttgttat tatttcagag ggaatggcga ttgaaatgtt acaacagaga 1980 

tttcttggtg gtagctattt gggtaaaggt atatggatat ttttctgtac atgtgaaatt 2040 

atataaaaat aaaagttata taaattacat tgaaaaaaaa aaaaaaaaaa aaaaaaaggg 2100 

cggccgc 2107 



<210> 14 

<211> 1262 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cctaatggcc cgasctgaat acttgaagga gctcaagatg agggaatctc gctgggaagc 60 

tgacaccctg gacaaagagg gactgtcgga atctgttcgt agctcttgca cccttcagtg 120 

accctagaag aatgattgga cagatgtgag ccatctggag cagaggggca ctaacccagg 180 

ctgacgccaa gaatgaagtg gcccactgca gccctggcga gcaggcttct tggatggaca 240 

gtgctgagac ccccatatcc cagagtcccc agcctccctc aggttactct gcaccccaca 300 

gatggtttga tggctgtgct gtatactgga ggggagggca ggactctggg agaacagcac 360 

ttctttcatg agacctttgt tactcggtgg ttactgggtc ctgtgcctgt ccgttttggg 420 

gcatgcagcc ctctatcatt tttggctccg agaagagggc aaggggcccc cgcaggtarc 480 
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ttctgtgctt gccctcgccc tgccagcagg cagctgtgcc cctggcctgc ccttcccggg 540 

accccttatt ccaactcagc tcctctttgc actggaatgg ggcactccaa cacccctcag 600 

ggaccaccct ccccacagta tgcactcagc cccacagaac ccaccagtct ttctgggaac 660 

tcacacctgc ccgccatctt ggtactttag gttaatccct caagcatgaa agctggatct 720 

tttggggttt aagaagccca agccttgttc ctgccctggc ctagggagca ctcaggaggg 780 

ttccttggtc ctcatctctc ccacctccgt tccctctggg ccccacacta gccacagcgc 840 

gggccttgtg ctggagtttg agcctgggac agggagaggg aggcttggag acagtctgac 900 

ccagtgccct ctaggccacc cacttctagg cctgccctgc cgccgtggag ccctgggcaa 960 

gctctttccc ctttctgggc ctgggtctcc ccatctcttc aatggggctg ataccttcac 1020 

agcccacagc atgggcactt atgaggacaa agtgaattta acctggaaaa gaatgtattt 1080 

gagagtttct tttaaataat cagcgggtgt tggtgatttg tagcccttct gcccttaaat 1140 

gcttccttgg gcaagagctg tctgtcctcc ctgcaggagg ctgagtgtga agagtatcat 1200 

tcattgtttc tctattaaat tattttctgc taaaaaaaaa aaaaaaaaat ttctgcggtc 12 60 

c 9 1262 



<210> 15 
<211> 759 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (16) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (22) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (36) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (51) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (52) 

<223> n equals a,t # g # or c 
<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (58) 

<223> n equals a,t,g, or c 
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<400> 15 

ggattaacaa attttncaca cnaggaaaac aggttnttga cccaattagg nnttttnnca 60 

aaaaagctta tttttaggtt gacacttatt agaagttacg ccttgcaggt taccggttcc 120 

ggaattcccg ggtcgaaccc caaggggttc gcggacccca gacatgagga ggctcctcct 180 

ggtcaccagc ctggtggttg tgctgctgtg ggaggcaggt gcagtcccag cacccaaggt 240 

ccctatcaag atgcaagtca aacactggcc ctcagagcag gacccagaga aggcctgggg 300 

cgcccgtgtg gtggagcctc cggagaagga cgaccagctg gtggtgctgt tccctgtcca 360 

gaagccgaaa ctcttgacca ccgaggagaa gccacgaggt cagggcaggg gccccatcct 420 

tccaggcacc aaggcctgga tggagaccga ggacaccctg ggccgtgtcc tgagtcccga 480 

gcccgaccat gacagcctgt accaccctcc gcctgaggag gaccagggcg aggagaggcc 540 

ccggttgtgg gtgatgccaa atcaccaggt gctcctggga ccggaggaag accaagacca 600 

catctaccac ccccagtagg gctccagggg ccatcactgc ccccgccctg tcccaaggcc 660 

caggctgttg ggactgggac cctccctacc ctgccccagc tagacaaata aaccccagca 720 

ggccgggaaa aaaaaaaaaa aaaaaaaaag ggcggccgc 759 



<210> 16 
<211> 1810 
<212> DNA 
<213> Homo sapiens 

<400> 16 

cacgagggtg tgcgtgctta ggcaggaacc cagttttact ttatgccatg tggaaagttt 
ctttttccag tatcaccagt gagttcactg tctctccact ggtctgcagt gctgcttctg 
ttacttgcag acttcccacg tgtgcatgga tctccacctg gggtctctag ggtctctatt 
ctacactgcc tatttccctt tctgtcctaa caccatagca tttaactcac ccgtcatcct 
gtgttgctga gaatttcctt catagaactc atcaaagtat gattaactgt gctccctgag 
ggcaggaatt atgccatctg gatcaccagc ctctcccttg tccttagcac gccatctgca 
aattagcaga tactcggtaa atgtgtatta actcgaagta tattttgtgt cttctctgtg 
cacagcactg ccctgggaag aactaggatg aggtattgac ttgctgttgc cacataacaa 
accctgccag aactccctgg atggaagtga ccaccgtgta tctgtggatt gtctgcaggg 
ctctgctggg gtcagcaggt cccacaacag agccagggct cggtctcctc atggctgtca 
gaggtttacg tattccgcct cctcccacca aagtctgaag ttgttgtatt ccattccttg 
ctatatccac atcttttaat aatgctaaaa tcccgtgttt ctctaaagca ttggattgaa 
ccaactgaag aaggaccacg tgtgttgctg ggcctgcttg ggcacaagcc gtttccgatc 
caagtcaact gctggtctgc ttagacgaag gtgtgtgggt gtctccacca cggagaggag 
ggacagcagg tgagaccata ggccaggaag gaagggcaca gcctaagcgt gcagtggctt 
agccagagac cctcgtgcac cagccttcca ggtgcttatt ggaacttatg tcagcccagg 
ccatatccaa gtgtgtgatg tctcggagca tatatgccag gccagccgga gaggcttagc 1020 
cctgccctgg tggagctgga gggccgcagg gccgcccggt ggggtcagga ggttgtgaag 1080 
aggatcctga tacaggctgg gcctccctgc aggcgtgagc cccggagcac ggggtgagca 1140 
gctccaccca gaggggcttg caggaccaag ctgggacagc aaccaccagg ccctggggca 1200 
gatcagtgag cgtccaggag atgcagatgc agaagacagc caaattcatt cacctctgcg 1260 
tgggcctgtg agggcccaca gagatgcatt ttcattcacg accaggattt cctcggccgg 1320 
agcagccgct tttcccagcc gaagctcact gtgtttacta cataggatgt gagtgtatag 1380 
aaagactctc tctaacgtta gtacgcgtgc agaaatgtgg ggccgcttac aagtgtgggc 1440 
agccgcagcc tgttcctcac ccctgtccta acgggacata ctccacgcat gcacatttag 1500 
gatcaccgtg tcttctcgtt ggactgatct gtcattagga ccctggaccc aagtaattgt 1560 
ctttgctctg aagttttgac agtaacaaag gcattccagc tctttctttt tcactcctgt 1620 
cggtgtaacg tgccgttttt catcctttga cttttagccc gcctgtgccc tgtctgaagg 1680 
gagttgtctg tggacagtca cggagtggtg ggtgtttgta atccactctg ccagcctcag 1740 
tcttctaact gttgcgtatg gaccaattac atctgccctt tctcttccct gctaaaaaaa 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1800 
1810 
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<210> 17 

<211> 1052 

<212> DNA 

<213> Homo sapiens 



<400> 17 

gcaattttct gcatagcatc agcaatgagt ctgtacaact gtcttgctgc actaattcat 60 

aagataccat atggacaatg cacgattgca tgtcgtggca aaaacatgga agtgagactt 120 

atttttctct ctggactgtg catagcagta gctgttgttt gggctgtgtt tcgaaatgaa 180 

gacaggtggg cttggatttt acaggatatc ttggggattg ctttctgtct gaatttaatt 240 

aaaacactga agttgcccaa cttcaagtca tgtgtgatac ttctaggcct tctcctcctc 300 

tatgatgtat tttttgtttt cataacacca ttcatcacaa agaatggtga gagtatcatg 360 

gttgaactcg cagctggacc ttttggaaat aatgaaaagt tgccagtagt catcagagta 420 

ccaaaactga tctatttctc agtaatgagt gtgtgcctca tgcctgtttc aatattgggt 480 

tttggagaca ttattgtacc aggcctgttg attgcatact gtagaagatt tgatgttcag 540 

actggttctt cttacatata ctatgtttcg tctacagttg cctatgctat tggcatgata 600 

cttacatttg ttgttctggt gctgatgaaa aaggggcaac ctgctctcct ctatttagta 660 

ccttgcacac ttattactgc ctcagttgtt gcctggagac gtaaggaaat gaaaaagttc 720 

tggaaaggta acagctatca gatgatggac catttggatt gtgcaacaaa tgaagaaaac 780 

cctgtgatat ctggtgaaca gattgtccag caataatatt atgtggaact gctataatgt 840 

gtcattgatt ttctacaaat agacttcgac tttttaaatt gacttttgaa ttgacaatct 900 

gaaagagtct tcaatgatat gcttgcaaaa atatattttt atgagctggt actgacagtt 960 

acatcataaa taactaaaac gctttgcttt taatgttaaa gttgtgcctt cacattaaat 1020 

aaaacatatg gtctgtgtar tttcaaaaaa aa 1052 



<210> 18 

<211> 1130 

<212> DNA 

<213> Homo sapiens 



<400> 18 

ggcacgaggc catttgtata attctttagt aaattgtatt aatgggagaa tctgtaagtt 60 

atgtctgaac tttcaggttg tcttataatt gtctttttcc ttatgtcaga tgttctatgt 120 

cataagaata aaatggttca caccaataca agtacttagt tgtggaaagg gagagtagaa 180 

gataaaaatg gagattttcc tgtgctacag gcttagtcaa gcttatggtc tatttaatgg 240 

ttatcaaagg caattaaata gtgttgaatg ttctgctttt acctacattt catttttcat 3 00 

gtacttagtt acaaattgaa ccctcttcta tttttttcct gctcctgttt ctgtttcatt 360 

ttagttttcc ttttccctga ttatcattta ggcatgtaag tgacacccag tagcattgct 420 

ttaattctgc tggtgacagt gccaaagctt tactatactc tttttgttgt ctgttgcttt 480 

tctcttgcta atttgcttga ctagataact aagaattcag gtaagcatta gctctttgtt 540 

cactgagaat aatacaactt gcaagataat taatttggat tgttctacat gtatttcgtt 600 

tatttctctt taccttgttc atttattacg acattttgaa ttatttacat acccatattt 660 

cttctttctt ttatggctca gctcactatg ctttttttta atactggtag cttcctcaag 720 

gttggaaaac aagatctgaa tactatagaa aataataact atttttctgt ggtcatatta 780 

aagatataat ggctttggat tttggggtga tttttctact gtcagtttaa aaaaaacttg 840 

tctatttgca tttgtgtgtt attacttcta gttaagagta tttccaagga aagtttcatg 900 

ttacttattt tgtttccatg tctttttcca aaagaactta ttttttatat tataataaat 960 

atcagtggaa aagtaggttt cgttatatag aaattaactt taggctgggt gcagtggctc 1020 

aagcctatat ttgggaggcc gaggcaggag gattgcttga actcaggagt tcgaaactag 1080 

cgtgggcaat gtagcgagac ctggtctcta caaaaaaaaa aaaaaaaaaa 1130 



<210> 19 
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60 



<211> 883 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 
<400> 19 

gtcaccgtgg gcgtttaant atgatccccg gctcagattc gcagactgca ctgaacttcg 

gctctacgtt gatgaagaag aagtctgatc ctgagggtcc cgcgctgctc ttccctgaga 120 

gtgaactttc catccggata ggtagagctg ggcttctttc agacaagagt gagaatggtg 180 

aggcatatca gagaaagaag gcggcagcca ctggccttcc agagggtcct gctgtccctg 240 

tgccttctcg agggaatctg gcacagcccg gcggcagcag ctggaggagg atcgcactgc 300 

tcatcttggc catcactata cacaacgttc cagagggtct cgctgttgga gttggatttg 360 

gggctataga aaagacggca tctgctacct ttgagagtgc caggaatttg gccattggaa 420 

tcgggatcca gaatttcccc gagggcctgg ctgtcagcct tcccttgcga ggggcaggct 480 

tctccacctg gagagctttc tggtatgggc agctgagcgg catggtggag cccctggccg 540 

gggtctttgg tgcctttgcc gtggtgctgg ctgagcccat cctgccctac gctctggcct 600 

ttgctgccgg tgccatggtc tacgtggtca tggacgacat catccccgaa gcccagatca 660 

gtggtaatgg gaaactggca tcctgggcct ccatcctggg atttgtagtg atgatgtcac 720 

tggacgttgg cctgggctag ggctgagacg cttcggaccc cgggaaaggc catacgaaga 780 

aacagcagtg gttggcttct atgggacaac aagcttcttt cttcacatta aaactttttt 840 

ccktcctctc ttcttcaaaa aaaaaaaaaa aaaaaaactc gag 883 



<210> 20 
<211> 989 
<212> DNA 

<213> Homo sapiens 
<400> 20 

ctggcttggc tgctatactc ttgcccttca ctgaacctca gttttcctca tctgaatagt 



60 



tgggagactc attcctgcct ttctcatgtc cctggctatt tggtaaacca gccagtagga 120 
agacatcgtg aaatgtatta aagtggtctt agctagacag agtgggcatg ccagggtcag 180 
cagagattct gaagtctaga ccagttccct gggtgggccg ttgtcagtcc tagcagatgg 240 
ccaggtcagc cctcaggctg gaaattttag ggcagctatt ggtaggtgtc tcctcttgct 
gtgctgagat acggtcaaga tcatacttag gcttttgttg gaagaacata caagacgaga 
gaaaaaaaaa gatcatactt aggggctccc ggaatttgct ctgccctagg ttgctgagac 
ctctagaacc tgtgcaggct aaaggaactc agtcggtaga tccgagagag gtggtcaggg 
agaccaggag catgtctaca ctgccagcag acttttgcct cctcccccaa gccagcagga 
tggcccaaaa aggctccccc agcagatcat ctttgcagct ccttttttag ctccagtggc 
agcagggatg aggaagggaa agttctatca tttttttcta atttaaaatg acatttaaaa 
tcactagcct agtgggggcc aggtgtggtg gctcatgcct: gttgtcccag cactttggga 720 
ggccaagacg agtggatcgc ttgagctcag gagttaaaga ccagtctggg caacatagcg 780 
aaacgccgtc tctataaaaa aatacaaaaa ttagctggat gtggtggtgc acacctgtat 
tcttagctgc ttgtggggct aaggcgaaag gatcacttga gcccaggagg tcaaggctgt 
agtgagctgt ttgtgccact gcactctagc ctgggtgaca aagcaaaacc ctgtctcaaa 
aaaaaaaaaa aaaaaaaaag ggcggccgc 



300 
360 
420 
480 
540 
600 
660 



840 
900 
960 
989 



<210> 21 
<211> 495 
<212> DNA 
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<213> Homo sapiens 



<400> 21 

ggtggaatgt agtgaaaacg agatgctgtc tctwagggac taaggaagct atctttcctc 60 

ttccgacatt tgctgactgg tcgatgctag atgatcttga gaccctgtgc tgaggactgg 120 

agaactacag gaaaggagca gtatctctga attatcgtgt ggaaggtcac ttgtctagcc 180 

cctgatgact gtgagcctct tcctcctgct tgccacctct cagtcacaag acggctgctg 240 

cgactcaggc tcatgtccca attcaaggca gcaagaaggc catggagcgg cacctgccag 300 

cagatgcccc tgcaggccat ctctccaggc tcaggaacct aaagaagaat ctacccagat 3 60 

gtggtgctca cacctgtgat cccaccactt tgggaggctg aggcaagagg atcacttgag 420 

cccaggagtt ggagaccaac gtaggcaata cagcaagact cccatctcta caaaaaaaaa 480 

aaaaaaaaac tcgag 495 



<210> 22 
<211> 2317 
<212> DNA 

<213> Homo sapiens 



<400> 22 

ccctaaagag ttgaagaact aattggtcgg taaaaattgg atattgaatt cataagatgt 60 

taaaatggac tggatttttg gtagttttgg ttgcttttaa aaaaattagt gctagctttc 120 

aagtgaatta caaccttaaa tttgagattt cctttggtga accatggaag tttacccagt 180 

ggtaaggaga actgtaatgt ttaggattct gaataagtac tgtgttttaa tcacagctct 240 

taattcaaag cattgattta attccacgta gtctgttata ttcagaaaca taaaaacaag 3 00 

tggaaaaagt atgaccccag tgtcggagat ggctatgtgt gcatgtatat acaaatagac 3 60 

acgtatatgt gcatgttacc tttacgtaca tgtggaaaaa cagttctaat taagtcaata 420 

tgtgtcttgt atttgcatat attagagtat gattttccta atggtcgagg gcctttttgg 480 

tgggccgcag tttggattta tgtggtatgt tgatgaagac ttagtgaata gccacagtac 540 

tcagcttttc ctctcacaga ttaatctgcc agtttctccc actgtgtatt gtgtatatgt 600 

gatagaattt gagggggaaa ataacacacc agctaatgat gaaacgaact ggctctagtc 660 

tgtaaggtaa cgggaccagc ccaattatat aactgattga ggcattgcca tttttcactt 720 

aactctttga atcaagtcat aattttattt aataattttt ttgaggcaga gtttcgctct 780 

tgtcgctcag gctggagtgc agtggcccaa tcttggctca ctgcaacctc cgcctcccag 840 

gttcaaggga ttctcctgcc tcagcctcct gagtggctgg gattacaggc acccgccacc 900 

aagcctagct attttttttg tattttttag tagagacggg gtttcaccat gttggccggg 960 

ctggtctaca actcctgact tcaggtgatt caccggcctc ggcctcccac acagctggga 1020 

ttacaggtgt gagccaccgt gcccgacctg aatcaagata ttatttaaaa gaactgtttg 1080 

atgttattta tttattcatg tcttcaatag gtatttacat gtctgtcttc ggagatggtg 1140 

ttagaggttt tttttttttt ttttgagacc gagtcttgct ctgttgccca ggctggaggg 1200 

cagtggcgcg atctcggctc actgcaagct ccgcctcccg aattcacgcc attctcctgc 1260 

ctcagcctcc tgagtggctg ggactacagg cgcccgccac cacgcccggc taatattttg 1320 

catttttagt ataggtgggg tttcaccgtg ttagccagga tggtgtcgat ctcctgacct 1380 

cgtgattcgc ccgcttcggc ttcctaaagt gctgggatta caggcgtgag ccaccaagcc 1440 

cggccggtga tagagttttg aacaagacaa agctccttgc aaagctaacc ttgggaggtt 1500 

tggggaaaac cacaaagagg gtgacaagat aatttcagat agggctaagg gataggaaga 1560 

aaatgaagat gggacatgtt aagtaataat aatagatgac atttgaacac tgtgccagtc 1620 

actctgtcga ttgttttaca tataccttgt gacaacccta agaggtaggc accgttatta 1680 

cggacatttt acaggtgaaa cagggacaca ggaaaagtaa gtaacatgcc cagctgttga 1740 

atcaaggcag cccgggacca gagtccactc tctgagaaat ggcatttggg caggatcgag 1800 

aatgaggagg agatccagtg tggcagaggc cctggggagg aatgggctgg aagtattcga 1860 

gaaacccaaa agtaagagtg gcctgagtgg cccagagagg ttgatgggag gcaggagtca 1920 

gagcgtgtct accctcgtag gccattggag gcttacatgt ggaagggaca ggttctgatt 1980 

cctgtttaaa aagagggctg ctgtccgggc gcggtggctc acgcctgtag tcccagcact 2040 

ttzgggaggcc gaggcgggcg gatcgcgagg tcgggagatc gagaccatcc tggctaacac 2100 
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cccgtctcta ctaaaaatac aaaaacaaaa ttagcccggc atggtggcgg gcgcctgtgg 
tctcagctgc tcgggaggct gaggcgaaag aatggcgtga acctggaagg cggagcttgc 
agtgagccgg gattgtgcca ctgcactcca gcttgggcga cagagcgaga ctccatctcc 
aaaaaggaat tcgatatcaa gcttatcgat accgtcg 



2160 
2220 
2280 
2317 



<210> 23 

<211> 1726 

<212> DNA 

<213> Homo sapiens 



<400> 23 

ctttttggct 

aaaagttctt 

tcatttttat 

ggtttggcaa 

tttgcctctc 

ctggaatgca 

cctcttgcct 

taagttttaa 

ctcctgggct 

agccacagca 

tcattattaa 

gctttgggat 

aaatcctgtt 

accaatctgt 

caataatatt 

tctctcactt 

gctgagagag 

taggaaaagt 

aagcaaaaca 

atcagccaaa 

cttccattcc 

ccaaccttgg 

tgaagcagca 

tgagggtggc 

aagatgccct 

ggctgattct 

tttatcattc 

tctagaaatg 

aatattttaa 



ctcattttga 

atatcgtgta 

tgcatttcac 

ccctatgtcc 

tgtcacattt 

gtggtgtaat 

gcgcttcttg 

atttttttat 

taagtzgatcc 

cctggtctct 

atctgttatg 

gcagtgaacc 

aatcaagagt 

gggtccccta 

gaaattagac 

taaatcaaaa 

gctgaaagct 

tcttgaagga 

gccttattgc 

acatgtcctt 

ttgaaggcac 

tttgtgcagt 

ggcactagtg 

tacactaaaa 

ctaaagcttt 

cttgctagag 

tgaaaattgt 

gaatagcaaa 

gcctatattt 



atttttcaag 

tgaatgtcat 

tttattgcac 

ataagtctgt 

tttttttttc 

tatgcctcac 

agtagctgag 

agagatgggg 

tcccacctca 

gtgtcacgtt 

gtgatctgtg 

cgtcagagtc 

gaacttaatc 

tcactctccc 

caattaataa 

gctagaaatt 

aggccggttg 

aattaaaatt 

ttatggaaag 

aagccaaagc 

agagaggtga 

ttaaggaaaa 

gggaagctgc 

acagattttc 

cataggtaga 

gccaatgaag 

agggccctta 

gcacgaatga 

gagacctact 



agctcatgtt 

tcgggatatc 

tttgcaaagt 

cagcaccatt 

ctgagacagg 

tgcggccttg 

actacagatg 

tttccctatg 

gcctttcaaa 

ataataattc 

atcagcgaac 

atccatgagg 

gattaatgtt 

tctcctcagg 

ccctgcaatg 

attaagctta 

tgccaaatag 

gttactccag 

aaagtctgaa 

ctaatctata 

agaagttgca 

aagccatctc 

agcaagttat 

aatggagaca 

gagaggtcag 

ccagtgagtt 

agcattatgc 

cagcacatct 

gctcaaaaaa 



ctttgtcttc 

tatacataca 

gacatttttt 

tttctaacat 

gtcttgctct 

acctcttggg 

tacaccacca 

ttgcccaagc 

gtgctgggat 

tgccaatatt 

tctgatgtta 

gttggaatca 

gtgtatgttc 

cctccctatt 

tgaaaggaag 

gtgaggaggg 

ctagccaagt 

tgaacacaca 

tggtctgaat 

atagatcagg 

gaagaaaagt 

cataacatgg 

acagaaaatc 

aaacaccctt 

tgcatgggct 

taagttgaag 

taaatctact 

gtttacaaca 

aaaaaa 



attaaaaaaa 

tgcacatacc 

acagattcaa 

atgtgctcat 

gtcacccagg 

ctcaagggat 

cacacccagc 

tgctctcgaa 

tacaggcatg 

ccagatttcg 

ctgtctaatt 

acttcttcca 

tgactgctcc 

ccctgagaca 

aagttacatg 

catttcgaaa 

tgtgaatgca 

aatgttaagt 

agaagatcac 

ccctaagtct 

tggaagctag 

aagtgcaaga 

tagctaatga 

ctattggaag 

tgaaagaaca 

ccagtgctaa 

ctgcctgtgc 

tgatgtgctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1726 



<210> 24 

<211> 529 

<212> DNA 

<213> Homo sapiens 

<400> 24 

acgcgtccga ttacttacgt gctcctggct gggatggcac tgggcattca gaaaaggttc 60 

tccccggagg tgctgggcct gtgtgcaagc acagcgctgg tgtgggtggt gatggaggtg 12 0 

ctggccctgc tcctgggcct ctacctggcc accgtgcgca gtgacctgag cacctttcac 

ctgctggcct acagtggcta caaatacgtg ggaatgatcc tcagtgtgct cacggggctg 

ctgttcggca gcgatggcta ctacgtggcg ctggcctgga cctcatcggc gctcatgtac 

ttcattgtgc gctctttgcg gacagcagcc ctgggccccg acagcatggg gggccccgtc 360 



180 
240 
300 
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ccccggcagc gtctccagct ctacctgact ctgggagctg cagccttcca gcccctcatc 420 
atatactggc tgactttcca cctggtccgg tgaccccctg gccccagatg gcactgagtt 480 
tttcattcat tgaagatttg atttccttga aaaaaaaaaa aaaaaaaaa 529 



<210> 25 
<211> 1755 
<212> DNA 

<213> Homo sapiens 



<400> 25 

ggcacgagcc tcacagcgcc tctgctggag ttcctgctgg ccttgtactt cctctttgct 60 

gatgccatgc agctgaatga caagtggcag ggcttgtgct ggcccatgat ggacttcctg 120 

cgctgtgtca ccgcggccct catctacttt gctatctcca tcacggccat cgccaagtac 180 

tcggatgggg cttccaaagc cgctgggggg tctgtgcctg acactcgggc tgtttgtcca 240 

agcagatctg aaatgggccg tgagctgggg gcagcagcct cccgggagca gggagtcagc 300 

cctgtgatgc atcccatcca ccctgtccac aggtgtttgg cttctttgct accatcgtgt 360 

ttgcaactgr tttctacctg atctttaacg acgtggccaa attcctcaaa caaggggact 420 

ctgcagatga gaccacagcc cacaagacag aagaagagaa ttccgactcg gactctgact 480 

gaaggcctgc gggtgccttg gcaacctgag ccacacaggc ctccacccct gcgcctcaca 540 

ggggtcgctg gcgttggagc ggaggcctgg acttctgagt tgcagagggg gctgcggaca 600 

cagcaggccc cctacagcct caggttctgc ctgagcccag cctaccaggc ttgcccctca 660 

gctcagcact gttgaccacg ctgcgtatga gggcatcttg ggtatcccac tccttctccc 720 

catttctgtc ccacaggcct tcagcccttt aacgtctctg ccaaaaacca gcacaaggag 780 

acaaagcaga gccttgtctg tatctgggca gcaggtgttc catgctgcta ggtggcgggg 840 

gtcgggggtc ttctgtttca ctaacaggaa caaagacaga aaccatgaca gggctgcccc 900 

gccaggcccc ggtgggtttg tctgcacttg gtgctcctgc ccacaccagc cactttggtg 960 

acaatgaccc ttccaagaat ctttggttca aggagcacca gttccctctt cattcttgaa 102 0 

gcagggagaa attgaccttt gccttgtcgc ccaggaagtg gggctcggca cccataacta 1080 

acacctccca cccttggaaa ccatgtcttc tgggggtgag atgaccattc tgggtctaag 1140 

actgtttcaa agaagagctc atagactgac tggtccagaa gacagagggt acaacagtgg 1200 

catcacagtg acagtgtcat ggggagctgg gcgggcccag ccaaaccctc cttcttccta 1260 

gagcccagcc agcaggcagg agttcctgga ccctcaggac agtgaacttc cagacctcag 1320 

ggcaggtcta tgggccactg caggagatga gaceagcctt ctgtgttcac ctaacgattt 1380 

atactgtgta tctgtctttg atggaatttt gtaacttttt atattttttt atgcaaaagc 1440 

agcttcttaa cagatggcat tttctgtgac tctaggcctc acaaaagagc cagagttctg 1500 

gacccatgtt tggagcattt gtagccttat tctcttgcgt gtgaatctct taccctgaaa 1560 

aaaagccata atgaattaag ccagactgac cacttgcttg gagtgtgtgc ttgaaaaaac 1620 

cagagcaata ctgttgggta ttgtatcagg cttcagtaca aactggtaac accaatgtgg 1680 

atcctgacag ctttcagttt tagcaaaaat acacgtgaaa tctgactacc atttaaaaaa 1740 

aaaaaaaaaa aaaaa 1755 



<210> 26 

<211> 1751 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1520) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (1557) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1689) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1729) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1735) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1741) 

<223> n equals a,t,g, or c 



<400> 26 

gggtgcagec tgatggcgca ggaggtagac acggcacagg gcgccgagat gcggcggggc 60 

gcgggcgcgg ctcggggacg cgcttcctgg tgctgggccc tggcgctgct ttggctcgcg 120 

gtggttccgg gctggtcccg ggtctcgggc atcccctccc ggcgccactg gccggtgccc 180 

tacaagcgct ttgacttccg tccaaaacct gatccttatt gtcaagctaa gtatactttc 240 

tgtccaactg gctcacctat cccagttatg gagggtgatg atgacattga agtttttcga 300 

ttacaagccc cagtatggga atttaaatat ggagacctcc tgggacactt gaaaattatg 360 

catgatgcca ttggattcag aagtacatta actggcaaga actacacaat ggaatggtat 420 

gaacttttcc aacttggcaa ctgtacattt ccccatctcc gacctgaaat ggatgcccct 480 

ttctggtgta atcaaggcgc tgcctgcttt tttgagggaa ttgatgatgt tcactggaag 540 

gaaaatggga cattagttca agtagcaact atatcaggaa acatgttcaa ccaaatggca 600 

aagtgggtga aacaggacaa tgaaacagga atttattatg agacatggaa tgtaaaagcc 660 

agcccagaaa agggggcaga gacatggttt gattcctacg actgttccaa atttgtgtta 720 

aggaccttta acaagttggc tgaatttgga gcagagttca agaacataga amccaactat 780 

acargaatat ttctttacag tggagaacct acttatctgg gaaatgaaac atctgttttt 840 

gggccaacag gaaacaagac tcttggttta gccataaaaa gattttatta ccccttcaaa 900 

ccacatttgc caactaaaga atttctgttg agtctcttgc aaatttttga tgcagtgatt 960 

gtgcacaaac agttctattt gttttataat tttgaatatt ggtttttacc tatgaaattc 1020 

ccttttatta aaataacata tgaagaaatc cctttaccta tcagaaacaa aacactctct 1080 

ggtttataaa acaccttaat tctactgctc ttttttctcc aatcaccagc atctgttttt 1140 

cagggggtga ttttactttt gtgaattcct tagcctttct tccttggtgc ataaagttaa 1200 

aatgcacatc agcagaattg ctgcatatta acatctcagg actcttctct tgtaaagaag 1260 

ctgaaattcg tactatattg gccaaagtga gcgagttagg tgatcttggt ttcaatttcc 1320 

gagcctttgt taatatggag aattatggtt catatcagtt atgtaggacc tttggaccca 1380 

gggtcctaca gatagatatg gtgtgcccag attttaaaaa taccttcaaa aataaaaaat 1440 

acattcagtg acattttcat ggtgggagct cttctttctg atatggcagt tacacttttt 1500 

cacttaagtg ctttagtttn agactaactt tacaacttct ataacttttg ggaaccnagt 1560 

ttagtatagt ctgattacat tccattcacc taactttagg cattcggttt agacaccata 1620 

actggrgkgr atkgkgcytc cyagratgtg ggcaaatccc agtggttaac accatatttc 1680 

tgggctggng attttgggga ctagctaggt aaacgggctt ggtggttcnt ttaancatac 1740 

ntaaccacca c 1751 
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<210> 27 

<211> 1212 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gccaagcttg gcacgargtt ggtggcggcg tccggaggtg ctggtttgtt ctcggtgaac 60 

ggcgcgcggg gtctctcctg agtgcgagct acgggacctt cgccatgccg gggatggtac 120 

tcttcggccg gcgctgggcc atcgccagcg acgacttggt cttcccaggg ttcttcgagc 180 

tggtcgtgcg agtgctgtgg tggattggca ttctgacgtt gtatctcatg cacagaggaa 240 

agctggactg tgctggtgga gccttgctca gcagttactt gatcgtcctc atgattctcc 300 

tggcagttgt catatgtact gtgtcagcca tcatgtgtgt cagcatgaga ggaacgattt 3 60 

gtaaccctgg accgcggaag tctatgtcta agctgcttta catccgcctg gcgctgtttt 420 

ttccagagat ggtctgggcc tctctggggg ctgcctgggt ggcagatggt gttcagtgcg 480 

acaggacagt tgtaaacggc atcatcgcaa ccgtcgtggt cagttggatc atcatcgctg 540 

ccacagtggt ttccattatc attgtctttg accctcttgg ggggaaaatg gctccatatt 600 

cctctgccgg ccccagccac ctggatagtc atgattcaag ccagttactt aatggcctca 660 

agacagcagc tacaagcgtg tgggaaacca gaatcaagct cttgtgctgt tgcattggga 720 

aagacgacca tactcgggtt gcttyttcga gtacggcaga gcttttctca acctactttt 780 

cagacacaga tctggtgccc agcgacattg cggcgggcct cgccctgctt catcagcaac 840 

aggacaatat caggaacaac caagacctgc ccaggtggtc tgccatgccc cagggagctc 900 

ccaggaagct gatctggatg cagaattaga aaactgccat cattacatgc agtttgcagc 960 

agcggcctat gggtggsccc tctacatcta cagaaacccc ctcacggggc tgtgcaggay 1020 

tggtggtgac tgaaattagc tggacatggt tgcacacacc tgtaatcaca gctactcggg 1080 

aggttgaggc gggagaatcg cttgaaccag ggagttggag gttgcagtga gtggagatca 114 0 

caccattgcc ctgcagccta agcaacagag caagattctg tctcaaaaaa aaaaaaaaaa 1200 

aaaaaactcg ag 1212 



<210> 28 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1105) 

<223> n equals a,t # g, or c 



<400> 28 



ggcacgagca 
ctggcttttt 
caaaagtgga 
tttcacactg 
tttaattgac 
aggtgaagtg 
acacttataa 
ggaaccgccc 
atgaggatta 
tatatgtgcc 
gccttgggga 
tgtgctgtgt 
aaagctttag 
gtgtaggaaa 



aacatccagg agtgtgcacc ggtcatgcaa ggtgttttgt ttggctttgt 
agttttttgt ggcaggagaa taaatctagt gcctctccct ccacattagc 
agtccctgtc cagtcagcat tccttggatg cctggtgtat tagtccgttt 
ctataataaa aagaactgcc caagactggg taactaataa aggaaagagg 
tcacacttct gcatgtttgg gaggcctcag gaaagttaca atcaggcaga 
cgttcgtctt aatggcggca ggtgagacag tgtgtaggat aaactgtcaa 
aaccatcata gctcatgaga cctcattcac tgtcacgaga acagcatggg 
ccatgatcta atcacctccc actaggtccc tccctccacc tgtggggatt 
caattcaaga tgagatttgg gcaggggcac cgagccaaac catatcacct 
cagtgttgac ctaggcgctg ggatgcagaa acaaacacga catgggctgt 
gctcacactc ttgctggaga agcatgctga ttcctaaata agaaatgcta 
acagagtacc atggaaggca ggatgaactc tttgggagga agaagcaagg 
agagttgctg gcttttgagg gatggagcag gcattttcta catggggaaa 
gagtattcca ggcagagtgg agagcaagag caaaggcgga gaagcctgtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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900 



ctgcgaattc cttgccgggc aggatccctg tcttactgct gtttagagat caatatatgt 

caagtgactg gaagtgtggt ttttgttctg ggactagtag gtagaacaga aagagttggg 960 

atggagtgag caacccatgg agaaatagag gctcggggtc agctgataca aggcgttgta 1020 

taccaagctg aggagcacaa gatttggaac ataataccaa atgctgggga gccatgggag 1080 

ggccatggga gctctgatag tgttntctcg ag 1112 



<210> 29 

<211> 748 

<212> DNA 

<213> Homo sapiens 

<400> 29 

ggcacgagcg aaactgtttt ccaatgtggc tgaaccactc tgcatttcca ccagtaatga 60 

gaatgagagt tgctgttgct ccacggcctc accagcattt ggtggtgtca gtgtcttgga 120 

ttttagccat cctaataagt gttagtggct atcattgttt tcatttgcaa ttctcttaca 180 

tggtgtkgaa catctttccc catgtttatt tgtcatctgc atatcttctt cggccagtta 240 

tctgttcaga tcttttgccc gtttttgttt gcttgcatgt ttgtttgtgt ttgatttttt 300 

aaagaaagct ttttttatta ttgagttgta atagtgcttg tatagtgtgg ataacagttc 3 60 

tctatcagat aggtcttttg caaatatttt ccccaatctg tggactgtct tctcattctt 420 

ttgataaatg gctttaaaat aataatctgg ccgggcgcag tggctcatgc ctgtaattcc 480 

agcactttgg gaggccaagg gcagatcatc tgaggtcggg agttcgagac cagcctgacc 540 

aacatggaga aaccccatct ctactaaaaa tataaaatta gtcgggcgtg gaggcacatg 600 

cctgtaatcc cagctacttg agaggctgag acaggagaat ctcttgaacc cgggaggtgg 660 

aggttgcagt gagccgaaat cgtgccactg tattccagcc tggacaataa gagcaaaact 720 

ccatctcaaa aaaaaaaaaa aactcgag 748 



<210> 30 

<211> 778 

<212> DNA 

<213> Homo sapiens 

<400> 30 

ggaactaaaa agctttgtgt tcttcagggt gggtggcagg gggatatagt gagggtggac 60 

cagggagaat gaccataggg cactaagtaa ggctgggatt ggatcagcag aaatccaacc 120 

ctctaacctt agggtaggga gtgctaagga tctggggaaa ccatgggctg ggaagctgct 180 

cttgctctcc tcgtgtctgc tgtttttttc ccttggtgta ctatacagag gccagatgtt 240 

ggcaccacct ctccaggagg attggaaagg aggagtaaag gattctgatt tgattgatga 300 

ttccagtgca tccccaatcc caccatctta cctggaatat aaggctgcct tgtacccctt 360 

ttctgagcac aagtctgtgc gtaatgcaac tgactctctt acttttttct tagtaactga 420 

tcatttccta gacaaccaag attctcaata agtcccagtc tcatcacaaa tattaatatt 480 

tccttttcct cataccaact tgactatgtt tcactgaaac ccacaggtct tgggacagaa 540 

tgaggcatta cctcattgaa ctttagctgc ctgcatgagt cctctgtcct caagtctttc 600 

tcagatcatt tctcaagctg gctcccagct tagggcaaag agaatctcca tgatgtgctg 660 

acttctagct tgccacagac acaattctac tccaaagtca gcctggcata gtaacattga 720 

tgtcagggga gacatatcag tttgaggcca tacaaaaaaa aaaaaaaaaa aactcgag 778 



<210> 31 
<211> 1324 
<212> DNA 

<213> Homo sapiens 



<400> 31 
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acgagctaat gattcttgct gaagatggcc agagtaatca gagtaattaa tttggggaat 60 

ggtgaaggat aaggaacctg atcagatatc aagggtgggg gtactcttgc taaactgact 120 

cagtagggtc cttgctaaaa ctggttttta caagagagag cacaggtagg tctacaagaa 180 

gattcgggag actgactaaa gtttcatcaa gaatctttgt caggagacat gacctttatg 240 

atacttaagt ttttcttttt atgcggtttt gttttaaaca ggctaatagc tcgtcagctt 300 

gctaaaatcc atgctattca tgcacacaat ggctggatcc ccaaatctaa tctttggcta 360 

aagatgggaa agtatttctc tctcattccc acaggatttg cagatgaaga cattaataaa 420 

aggtaaaatt atttttacat ttgaaattat gttttacatt gctttgctct atgatagggt 480 

ttcaaaggta attgtaaagt ttcattgtat aaaatctggt tttctttctt tgcattgaag 540 

taaagtaagc attgattctt tggctgtcag atagcactac agaaataact gcctctccat 600 

ccccctcagt gtcccctccc aaaaaatatg ccctacaaca gcaaggggca gaagtggaag 660 

taggggaaca ccacttaaaa ataaagaggt aggtggtaat ggtgaagcaa ggatttttct 720 

tgacttttta agatactgca gggttgagag gcaagtctag tattatatag aataagagca 780 

caagagcctg gagccaaact gagattaaat cttagtcctg ctagttaaca tttctgttgt 840 

gtaactcatt caactaggga acttaaccta actgtttcct tatctataaa atggaaatta 900 

cagtagtagt atattaacca tatagagttt ttgtgaggat tgagatagta tatgtaaagt 960 

tctttaaaac agtgcctggt catcacttaa gtgttgaggt agctgctgtt ttaaaaatta 1020 

ctattgttat tcaaagaagg ttatttgagt tatatttttt ccctaggctc tccaaggtat 1080 

actttaaatc cttgaggtta atgattcttt ggaaaagctg gaggtgtgct gtggtaaata 1140 

ataggaagaa aacccttgcc ccaatagaaa ataatacaac tagaacataa aacacaatta 1200 

aaatattaaa tacattttgt atttctttta ttccattttg tttgttcatg atttaagttt 1260 

tataatcctt cgaatcccac aattttcatt cgacactacc tttaaaaaaa aaaaaaaaaa 1320 

aaaa 1324 



<210> 32 

<211> 739 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (732) 

<223> n equals a,t,g, or c 



<400> 32 

ggcacgagga caggatcctg gtttgggtac cttagtttaa tagaaaccca ggtggaaacc 60 

tagattgcag ggcatttgtt tgagactatt tagccacagc agggcaagca ggaagatgca 120 

gcctgtcact gctaactcct tagtattaaa actgtcaaac atgggaggta actgctgatg 180 

catttttcag ttgatagttt atatactttc tctgaaggat cctaatgata gttaaccatt 240 

tctcattttt attttgctgg attgttttct gttttttgct tcagcattct tgcttttgct 300 

gtgcttactt ttggagtttt gattccctgt gtcactgttt tctttcgcat acacctctca 360 

ggtttacaca gtaaacaatg tgaatgtgat caccaaaata cgcacagaac atctgaccga 420 

ggaggaaaaa aagagatata aaggtaatca ccaccaccct cccacctcct gttttgttgt 480 

tattttttaa gcctagaggg aactctttgt tggctctgtt aagtttaggg ttaatgtgat 540 

tgggttgtgt taagcctaac cctaacttct wctctctctc tctctttttt tttttttttg 600 

aggcaaagtc tcgccctgtc acccaggctg gagtgcagtg gcacgacctt ggctcactgc 660 

aacctctgcc tcctgaggca ggagaatcac ttgaacctgg caggcggagg ttttggtgag 720 

ctgaggtcgt gncattgca 739 



<210> 33 

<211> 1462 

<212> DNA 

<213> Homo sapiens 
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<400> 33 

ggccatcggc ggggcagtcg cgggatgcgc ccgggagcca cagcctgagc tttagcccat 60 

gaggaggatg tgaccgggac tgagtcagga gccctctgga agcatggaga ctgtggtgat 120 

tgttgccata ggtgtgctgg ccaccatctt tctggcttcg tttgcagcct tggtgctggt 180 

ttgcaggcag cgctactgcc ggccgcgaga cctgctgcag cgctatgatt ctaagcccat 240 

tgtggacctc attggtgcca tggagaccca gtctgagccc tctgagttag aactggacga 300 

tgtcgttatc accaaccccc acattgaggc cattctggag aatgaagact ggatcgaaga 3 60 

tgcctcgggt ctcatgtccc actgcattgc catcttgaag atttgtcaca ctctgacaga 420 

gaagcttgtt gccatgacaa tgggctctgg ggccaagatg aagacttcag ccagtgtcag 480 

cgacatcatt gtggtggcca agcggatcag ccccagggtg gatgatgttg tgaagtcgat 540 

gtaccctccg ttggacccca aactcctgga cgcacggacg actgccctgc tcctgtctgt 600 

cagtcacctg gtgctggtga caaggaatgc ctgccatctg acgggaggcc tggactggat 660 

tgaccagtct ctgtcggctg ctgaggagca tttggaagtc cttcgagaag cagccctagc 720 

ttctgagcca gataaaggcc tcccaggccc tgaaggcttc ctgcaggagc agtctgcaat 7 80 

ttagtgccta caggccagca gctagccatg aaggcccctg ccgccatccc tggatggctc 840 

agcttagcct tctacttttt cctatagagt tagttgttct ccayggctgg agagttcagc 900 

tgtgtgtgca tagtaaagca ggagatcccc gtcagtttat gcctcttttg cagttgcaaa 960 

ctgtggctgg tgagtggcag tctaatacta cagttagggg agatgccatt cactctctgc 1020 

aagaggagta ttgaaaactg gtggactgtc agctttattt agctcaccta gtgttttcaa 1080 

gaaaattgag ccaccgtcta agaaatcaag aggtttcaca ttaaaattag aatttctggc 1140 

ctctctcgat cggtcagaat gtgtggcaat tctgatctgc attttcagaa gaggacaatc 1200 

aattgaaact aagtaggggt ttcttctttt ggcaagactt gtactctctc acctggcctg 1260 

tttcatttat ttgtattatc tgcctggtcc ctgaggcgtc tgggtctctc ctctcccttg 1320 

caggtttggg tttgaagctg aggaactaca aagttgatga tttctttttt atctttatgc 1380 

ctgcaatttt acctagctac cactaggtgg atagtaaatt tatacttatg tttcectcaa 1440 

aaaaaaaaaa aaaaaactcg ag 1462 



<210> 34 

<211> 2815 

<212> DNA 

<213> Homo sapiens 

<400> 34 

gggtcctgga gtgccctcgg ctgatagaga ctatagttcg agagttcttg cccaccagtt 60 

ggtctcctgt gggggcaggg cctaccccta gtctatacaa agtaccctgt gctactgcca 120 

tgaaactact tcgtgtcctg gcctcagctg ggaggaatat tgctgcccgg ctgttgagca 180 

gctttgatct ccggagccgc ctgtgccgca tcatagctga ggctccccaa gaactggcct 240 

tgcccccaga ggaagctgag atgctgagca ccgaggccct ccgtctgtgg gctgtggctg 300 

cctcctatgg ccagggcggt tacctttaca gggagctcta cccagtgctg atgcgggcct 360 

tgcaggtggt gccgcgggag ctcagcaccc acccacctca acccctgtcc atgcagcgga 420 

tagcctcact gctcactctc ctcacccagc taaccctggc agccggcagt acccctgctg 480 

aaaccatcag tgattctgct gaggccagcc tctcggccac cccttcctta gtcacttgga 540 

cacaggtgtc tgggctccag cctcttgttg agccgtgtct aaggcagacc ttgaagttgc 600 

tgtccagacc tgagatgtgg agagccgtgg gcccagtgcc cgttgcctgc ctgttgttcc 660 

tgggagccta ctaccaggcc tggagccagc aaccaagctc atgcccggag gattggctcc 720 

aggacatgga gcgcctgtca gagagctgct gctgccactg ctgagtcagc ccacactggg 780 

cagcctgtgg gattccctta ggcactgctc ccttctctgc aacccgctgt cctgtgtgcc 840 

agcccttgaa gctcccccca gcctcgtgtc actgggctgc tcgggaggct gcccccgtct 900 

cagtctggct ggctcagcct cacccttccc attcctcact gccctcctct ctcttcttaa 960 

taccctggcc cagatccaca aggggctgtg tggccagctg gctgccatat tggctgcccc 1020 

gggactccag aattacttcc tccagtgtgt ggctcctggg gctgccccac acctcacacc 1080 

tttctctgca tgggccctgc gccatgagta ccacctgcag tacctggcac tcgctctggc 1 ?"f 0 
ccagaaagcg gcagcgctgc agccactgcc agccacccat gctgccctct atcatggtat 



1200 
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ggccttggcc ctgctgagcc ggctgctgcc cggaagtgag tacctcaccc atgagctgct 1260 

gctgagctgt gtattccggc tggagttcct cccggaaaga acatcagggg gtccagaggc 1320 

agccgacttc tctgaccagc tgtcgttagg aagcagcaga gtccctcggt gtgggcaagg 1380 

gactctgctg gctcaggcct gccaggacct ccccagcatc cgcaactgct acctgactca 1440 

ttgctcgcca gcccgagcca gtctgctggc ctcccaggct ctgcaccgag gggagctaca 1500 

gcgagtccca accctgctac tgcccatgcc tacggagccg ctgctgccca ccgactggcc 1560 

cttcctgcac tgattcgcct ctacaccggg cttcagacac cccctcggga ctctctccac 1620 

agacaccatg ggcacagcca tgcgggtcct gcagtgggtg ctagttttgg agagctggcg 1680 

cccccaggct ctctgggctg tgccccctgc tgcccgcctg gcacggctca tgtgtgtgtt 1740 

cctggtggac agtgagctgt tccgggagtc cccagtacag catctggtgg cagccctcct 1800 

cgcccagctc tgtcagcctc aagtcttgcc aaacctcaac ctggactgcc gactccctgg 1860 

cctgacgtct ttccctgacc tctatgccaa cttcctggat cattttgagg ctgtctcttt 1920 

tggggaccac ctctttgggg ccctggtcct cctgcccctg cagcgtcggt tcagtgtcac 1980 

cttgcgcctt gccctctttg gggaacacgt gggagccttg cgagctctga gcctgcctct 2040 

gacccagttg cctgtgtccc tggagtgtta cacagtgcct cctgaagaca acctggccct 2100 

ccttcagctc tacttccgga ccctggttac tggtgcgctc cgcccacgtt ggtgccccgt 2160 

gctatatgct gtggctgtgg ctcatgtcaa tagcttcatc ttctctcagg acccacagag 2220 

ctcagatgag gtcaaagctg cccgcaggag tatgctgcag aaaacatggc tgctggcaga 2280 

tgagggtctc cggcagcacc tcctgcacta taagcttccc aattccacgc tcccagaggg 2340 

ctttgagctc tattctcagt tgccccctct gcgtcagcac tacctccaga gactgacttc 2400 

aacagtgctc caaaatgggg tatcagagac ctaggatagt tgatatagat ggaaagatgg 2460 

gtacgttgtc ctgtatccag cctttcaaca gatgtctggc cagacgaaga acattgtgtc 2 52 0 

ctaatggtag gcaggagacc aaggagcaga aggcttgcct tcctgggagc aggttgtttg 2580 

agctgtttta gagcagtgag ccctaccatt acatcctgat atctggggct tctgaaggtc 2640 

tgtgctggga gtgaagagtg gcttagctat ttacccgctc tttggggaca gggcaaacta 2700 

aatgcatccc ttcttaccta actcccaacc cctgccctgg gctgaggcat atgaatgcta 2760 

tagttgtgca ttaaaataaa tgttttttat ctcctggaaa aaaaaaaaaa aaaaa 2815 



<210> 35 

<211> 1078 

<212> DNA 

<213> Homo sapiens 



<400> 35 

ggtgggctct gtgctgggtg ccttcctcac cttcccaggc ctgcggctgg cccagaccca 60 

ccgggacgca ctgaccatgt cggaggacag acccatgctg cagttcctcc tgcacaccag 120 

cttcctgtct cccctgttca tcctgtggct ctggacaaag cccattgcac gggacttcct 180 

gcaccagccg ccgtttgggg agacgcgttt ctccctgctg tccgattctg ccttcgactc 240 

tgggcgcctc tggttgctgg tggtgctgtg cctgctgcgg ctggcggtga cccggcccca 300 

cctgcaggcc tacctgtgcc tggccaaggc ccgggtggag cagctgcgaa gggaggctgg 360 

ccgcatcgaa gcccgtgaaa tccagcagag ggtggtccga gtctactgct atgtgaccgt 420 

ggtgagcttg cagtacctga cgccgctcat cctcaccctc aactgcacac ttctgctcaa 480 

gacgctggga ggctattcct ggggcytggg cccagctcct ctactatccc cccgacccat 540 

cctcagccag cgctgccccc atcggctctg gggaggacga agtccagcag actgcagcgc 600 

ggattgccgg ggcyctgggt ggcctgctta ctcccctctt cctccgtggc gtcctggcct 660 

acctcrtctg gtggacggct gcctgccagc tgctcgccag ccttttcggc ctctacttcc 720 

accagcactt ggcaggctcc tagctgcctg cagaccctcc tggggccctg aggtctgttc 780 

ctggggcagc gggacactag cctgccccct ctgtttgcgc ccccgtgtcc ccagctgcaa 840 

ggtggggccg gactccccgg cgttcccttc accacagtgc ctgacccgcg gccccccttg 900 

gacgccgagt ttctgcctca gaactgtctc tcctgggccc agcagcatga gggtcccgag 960 

gccattgtct ccgaagcgta tgtgccaggt ttgagtggcr agggtgatgc tggctgctct 102 0 

tctgaacaaa taaaggagca tgccgatttt taaaaaaaaa aaaaaaaaaa aaaaaaaa 107 8 
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<210> 36 
<211> 1217 
<212> DNA 
<213> Homo sapiens 

<400> 36 

cggcacgagg ttgaatgtta gccctggagg agatccatgt cttactcgct ctttctggcc 
cttctgtctt ttgcctctgc aattcttttt gtagctggca cgatagcagg gactgggggt 
ctatcctttc atggtattgc tacaatattt gtccttactg gaaaatggta acatccgggt 
ctgatttaat tggcattaca cttacacagg gactctgagc acccccgtca ccacaccaga 
cagtggacca gttttcacag ctacaaagag ctagaaatgt gtttaacatc atccagtgca 
tcccctaatt caaaaccatc ctcactaatc aatcatattc acccataaat attacaaatg 
agattgattc catctcaaga caatttgtca aatacttaat tttcttcctg gatgattcta 
cttactggat attttagaaa gagaaatgtc tgagataaaa tccctcacat ttactcaata 
taacaaatta ctgtttctac tcctattctg agtagtgctt ctgaagattg tttgctgtag 
tgttgtcttt gataaaatga atgtcagtag tgagcctttt agagatacca tgctcagaaa 
tcctctttgg gatcagaaga tacctaaaat tctccccttt tgcccacttg gttagatgag 
tgatatattc tttggatcct gcaaagaaga gattggtttc ttttcttttc tggtggtggt 720 
agtggttgta tctgtggctg tgatggttgt tgttacttgt ctctctctct ctctggctct 
ggcttttgct ttcctgctag tgttctttct ctttccaaac aaatagttaa attaaacgtg 
agcttctgaa ttgtacttgt tcatactttc aaaacataac agattaataa aaatagatgt 
gtcctgattt aaaacatgcc ccctggaaag gcatgctgta ttatgaaatc atgataatat 
aactgcatta ttacatggca gtataaatat tagtctgttg aattcatttg tccaattgta 
taactttgtg gagcagtgtt ttgacctttg atacataatt ctggagcaag tggagtggtt 
gcaggcagat gagacagtgt tatatcagga tttttcaatc aactttagtt ggaggcctgg 1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



780 
840 
900 
960 
1020 
1080 



caattacaaa catcttcaga tgtttctgta accattataa atatgaaaaa aacctcttca 
aaaaaaaaaa aaaaaaa 



1200 
1217 



<210> 37 

<211> 1282 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (153) 

<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (1220) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1222) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1232) 

<223> n equals a,t,g, or c 



<220> 
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<221> SITE 
<222> (1246) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1282) 

<223> n equals a,t,g, or c 



<400> 37 

actcgtgccg aattcggcac gagccattct gagtttggtc ccttcccaaa agtaggggtt 60 

ttgtgtggaa aatctgagca aacctctgtt gactgttctg gggtggagtg aagggagaar 120 

gggctcagct aaagaacatg gggagattag ggnaacaatg ccttttattt cttgctttta 180 

aagcaatttc aggagttttc ttcctctttt ggcgtcctgc tgactccaca gagcggaaca 240 

cccaaagctg ggactttcca cctctctaat gctcagtgaa gagcgggcca ggggggtgtg 300 

gaaaagaaag ggtcctggag gagcccaaat tacgaatggc tagagactgg cattggcaag 3 60 

cgaggaggct tcgtcacagt gtagtcttcc ggttgtccga gggtactgtc ccaggggctg 420 

gggggtrttc cgtcttctgc agatcaactc ccgcaggcta aatgtggaca tcgcggtatc 480 

atgcttgata aacggaccaa taatcaagtg gagattcatt agaaccacat aacccatact 540 

aggttgattt ctcaagtata agscctggtc tgttgcccag sctggagtgc actgacacca 600 

tcatggttca ctgcagcctc aaactcctga gcccaagtga tycctcccac tcagcctcac 660 

aagtagctaa gactagaggt gtgcaccatc amacccagct aatttttaaa gttttttttg 720 

tararatggg gtcccactct acaaaatatw taagtataag gcckggtctg ttgcccargc 780 

tgggsaaccc ttggactaag gcaatcctcc agcctcagcc tcctaaagtg ctgggattac 840 

aggcgtgagc caccgcaccc acctctagga tctctactat tgaggaaaaa ttggaggcat 900 

caaactccaa gggcaaaaca tgaagactcg ctggcccacc atggatggag gttttctctc 960 

ttaaaattcc cacagcaccg catggaactg cctctcctgg gacctcagcg tttccttctt 1020 

tgctctaagc aatagcctct gccactggag attctgagat ggccgatttc cttttggata 1080 

tttaagtttt gaaatcatgc tcatttggca taggaatgtt tcacttcagt ctcctttaaa 1140 

caaaaggaca cacaaccacg attgcccctc cctcccgaag ggtcactgga cttcatgcat 1200 

cagtaatgtt tccaaaaatn tnttaagtac cnacatgcag tggccngctt ttcatttttc 1260 

caagtgaagc catcagaaaa an 1282 



<210> 38 

<211> 559 

<212> DNA 

<213> Homo sapiens 



<400> 38 

gattcggcac gagctgaagc cctgggtgcc actgctggcc cagcagggag gaggttgctg 60 

ctgctcgggc tgaagtgagg tgtgggtctg gctgggcctc cagtttccca cctgggcctt 120 

gattgtgagg aaggcctggc ctggctgcag aagcccagaa gcacctgagt aggagagttc 180 

ctttgtccca cctgcagctc attcaagcct gtgcatgggg gttggggtcc tcaggatctt 240 

gctttcctgt ttaggggagg cagccccaaa gagtgctggg accagtttgg agagtgctaa 300 

ggaatgctgg tctgcagcga ccctacttgt gctctgcgtc ctctgccaac tgcagcatgg 360 

gtgaacatct gtacatctgt ccccataatg aaaatggcct cagcaaataa caaaaatatt 420 

accatttagc aatcaggcac ttattaaaag cctggcccaa taaacttaaa aaaaaaaaaa 480 

aaaaactcga gggggggccc ggtacccaat tcgccctata gtgagtcgta ttacgcgsgs 540 

tcamtggccg tcgtttaca 559 



<210> 39 
<211> 803 
<212> DNA 
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<213> Homo sapiens 
<400> 39 

ggcagagcta ggccaggcag agcctagctc ttgccagggc agcaggaagc cacacagtgt 60 

gttgaagccg gagcaggaga gggggccctg actcccatgt gtccttgcag gcaggagcag 120 

ttcgtggact tgtacaagga gtttgagcca agcctggtca acagcaccgt ctacatcatg 180 

gccatggcca tccagatggc acctttcgcc atcaattaca aagtaaggcc tgggccctgc 240 

craaaacattc actgtctgcc cacccagccc caccccatga agccatctgt ccctcatccc 300 

cacagggccc gcccttcatg gagagcctgc ccgagaacaa gcccctggtg tggagtctgg 360 

cagtttcact cctggccatc attggcctgc tcctcggctc ctcgcccgac ttcaacagcc 420 

agtttggcct cgtggacatc cctgtggagt tcaagctggt cattgcccag gtcctgctcc 480 

tggacttctg cctggcgctc ctggccgacc gcgtcctgca gttcttcctg gggaccccga 540 

agctgaaagt gccttcctga gatggcagtg ctggtaccca ctgcccaccc tggctgccgc 600 

tgggcgggaa ccccaacagg gccccgggag ggaaccctgc ccccaacccc ccacagcaag 660 

gctgtacagt ctcgcccttg gaagactgag ctgggacccc cacagccatc cgctggcttg 720 

gccagcagaa ccagccccaa gccagcacct ttggtaaata aagcagcatc tgagatttta 780 

aaaaaaaaaa aaaaaaactc gag 803 



<210> 40 
<211> 1510 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (426) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a,t,g, or c 
<400> 40 

cacgagaaac attctatctt ttatcaaatg tgtgattcat aacttttgga taccaaagga 
atctaacgaa ataaccataa tcatcaatcc atacagggag actgtgtgct tctctgtgga 
gcctgtcaag aagatattta actatatgat acatgtgaat cgaaacatca tggatttcaa 
actcttcctt gtgtttgtgg caggagtttt tcttttcttt tatgcaagga ccctggagtc 
aaagccctac tttctattac tcctcgggaa ctgtgctagg tgttctaatg acatagtctt 
tgtcttgctg ttggtgaaaa gattcatccg aagtatagca ccttttgggg ctctaatggt 3 60 

tggttgttgg tttgcctcag tttatattgt atgccagttg atggaagatc tgaagtggct 420 
gtggtntgaa aacaggatat atgtatcagg ctangtcttg atagttggat ttttcagctt 
tgttgtttgt tacaagcatg ggccccttgc acacgacagg agcagaagtc ttctgatgtg 
gatgctgcga ctcctctccc tggttctggt ctatgctggt gtggctgtgc ctcagtttgc 
ctatgcagcc ataatcctcc tcatgtcctc ctggagtctg cactacccac tgagagcatg 
cagttatatg aggtggaaaa tggagcagtg gtttacatca aaagagctgg tggtgaaata 720 
tcttacggaa gacgagtaca gggagcaagc tgatgctgaa acgaacagtg ctctggagga 780 
gctacgccgg gcctgccgaa aacccgactt tccctcatgg ctggtcgtct ccagactcca 
cactcctagc aaatttgcag attttgttct tggaggaagc cacttgtcac ctgaagaaat 
cagtctgcat gaagagcagt atggccttgg gggtgccttc ttggaagagc agctctttaa 
cccgagtact gcctgacatg cgaccttcaa gttgacttca ttctggacaa ggaagtgggc 
aaagggcagg attctattaa agttaggcag aactgttcta gtgaacggtg gcaaaaacat 
ttgctgtgga gaaaaacaag tcagtctgga aaggaaaacc aacccatttt gaagataact 1140 
tagcattctt ggtgacttct gctacttatt gtactgtagg tggataccaa aattctgtga 1200 



60 
120 
180 
240 
300 



480 
540 
600 
660 



840 
900 
960 
1020 
1080 
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cagccactac 
cttaaggggc 
ttaatcccct 
aagtccttaa 
ctactaccaa 



cacttacctt gaatgaaggc tttcattagg aacaggggaa tggcgttgtt 
tagtaagcat gaacaggtgc tttgtcgaca ccagggcact aaatctggtc 
gaacctgtgt cagaagactc tgcaatactc ttcctatagt tcgtcagtat 
agagacctga gacatgctgg accagtgttt tccaaagtac agctcacagg 
gtgttggtca ataaaggtat tctgaggtca actaagattg ataaaaaaaa 



1260 
1320 
1380 
1440 
1500 
1510 



aaaaaaaaaa 



<210> 41 
<211> 1095 
<212> DNA 

<213> Homo sapiens 
<400> 41 

gcttggtggt gctatttgct tcttcaaatt ctcgttattt aaaatatttc tttcttgtac 60 

cgttgattct gggatcagcc tggatgtgtc aaacactgcc tgccaggctt agagctcagt 120 

gcatttcttc ccttttattc ctgctgatgg gattgctggc catgaccggt gagaggaatc 180 

aaggaaccca ttactatgag ttctcaggat tcatcttcaa atctcaaatg atgtggtcaa 240 

ttaaaccaaa ttaaaaacaa gctcttgtta aaagcaagtt aaaaacaagc tcttgacctt 3 00 

gagaagaaat gattggtatt aggaagactg ttgagctgat actgcccttc attcattctc 360 

taccctggtg cttggataca ggagcaaagt aagaaaataa tcacagcttt attgagggct 420 

ctatgagcaa ggcttggtga ggatggaaga gaatggagct atcagttgat gagaacctac 480 

taggtgttga gctccttaca ttcattgcct atttaaaact ttctaacaac ttcatgtgta 540 

agcgttgtcc cgatttaaaa aaaaaaatag atgtggaaac tgaacctgga gaaggtgtgt 600 

aatttgtcca aggttgcaca ggcaaagggg caaaattcag ctttaaaccc aggactgttt 660 

ccacagctcc aagtyccctt tattcatggg atttgtaaga tggagcccct gccactgtag 720 

catttataac ttactttgga gaataagatt cctgaaagta cgtttaataa aaaaaaaaga 780 

tgtccagcta tgtacggcag ctcacgcctg taatcccagc actttgagag gcaaaggggg 840 

gaggatagct tgaggctaag agtttaagac taacctgggc aacatggcaa gaccctgtct 900 

ctaaaaaaca aaattagcca gttgtggtgg catgcacctg tagtccaagc tactcaggag 960 

gctaaggtga gagggtcgct tgagcccagg agtttgaggc tgcagtgagc catgatggcg 1020 

ccactgcact ccagtgcaga gtgcaggcta cagaatgaga ccccatcaca aaaaaaaaaa 1080 

aaaaaaaaac tcgag 1095 



<210> 42 
<211> 1162 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (340) 

<223> n equals a,t,g, or c 



<400> 42 



ggcacgagct gattcctaag gaatattcta gccaaatcat gtatctgtgg tttagttttt 
ctacagtagg gctgtgcggt tgctgcctgc tttatagggc atgtgggttt atatggtatc 
tgctgttact tgggcacagc agcaccaact cattacagga tggaggggca gaacgcccag 
agcacccctg ggctcacgtg cggtacagct gcaggagaga gctgtccttt tggttttatg 
tttttaatta attctgtttc ctcagattga tgattaaatt tatttttcca gcctgaccaa 
gaaggcgtca ccataccaga tctggggagt ctctcctcan ctctgataga cacagagagg 
aatctgggcc tgcttctcgg attacacgct tcctatttag caatgagcac accgctgtct 
cctgtcgaga ttgaatgtgc cagtaagaaa atctttactt tttgctaatt agcagatttt 
ttttttttgg aactgtaagt gccattaaga gtgggagagg gccaggcaca gtggttcatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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cctgtaatcc 
agaccagcct 
gcatgttggc 
gagcctggga 
gacagagcaa 
taatcccagc 
cagcctggcc 
ggtgacgggt 
ccaggacaca 
agagcgagac 
gacaaaaaaa 



cagcactttg 
gggcaacatg 
acgtatttgt 
ggttgaggct 
gactctctct 
actttgggag 
aatgtggtga 
gcctgtaatc 
gaggttgcag 
tcggtctcca 
aaaaaaaaaa 



ggaggttgtg 
gcaaaacccc 
agtcccagat 
gcagtgagtc 
ttaaaaaagc 
gctgaggcgg 
aaccccatgt 
ctaggtactc 
tgagctgaga 
aaaaaaaaaa 
aa 



gcacgtggat 
atctctacaa 
actcaggagg 
atgatcatac 
aggagatggc 
gtggatcacc 
ctactaaaaa 
gggacgctga 
tcacgccact 
aggagaggag 



tgcttgagat 
aaaacacaaa 
ctgaggtagg 
cactgcactc 
caggcagtgg 
tgaggtcagg 
tgcaaaaatt 
cgtaggagaa 
gcactccagc 
gattcaacac 



caagattttg 
aattagccag 
aggattgctt 
cagcctgggt 
ctcatgcctg 
agttcaagac 
agctgggtgt 
ttgcttgaac 
ctgggtgaca 
agttgatgat 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1162 



<210> 43 
<211> 657 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<400> 43 

cccccccggg gntgcaggaa ttcggcacar attttacatg cttttaagtt aatgttggaa 

aactaatcac aagcagtttc taaaccaaaa aatgacatgt tgtaaaagga caataaacgt 

tgggtcaaaa tggagcctga gtcctgggcc ctgtgcctgc ttcttttcct gggaacagcc 

ttgggctacc caccactccc aaggcattct tccaaatgtg aaatcctgga agtaagattg 

caccttcttc ctctcctgat caacatcggt atgatgtctc ctgttgcctc accctttgtc 

tgcagtatca ctggatagga ctggtggaaa gggagcagcc tgacagagct ccaaatgtgg 360 

agaatatggc atccctccac ctatatttga tgtggacggt aaggctaggc ctgcaggatc 
ccttatcctg accaaagact gtgttggggt gccatttgaa aatcgcaggg ttgcaaaaga 

atacaatctt acttgcaggt ggatattctc tatactctct tttaatgcat ctaaaaatcc 

caaacatccc ctggttggtg atcacttaca gttgtgtcca cctttatttt atgtactttg 
attaaaaaaa aaaaactttt tgttaatata aaaaaaaaaa aaaaaaaaaa aaaaaaa 657 



60 
120 
180 
240 
300 



420 
480 
540 
600 



<210> 44 

<211> 1155 

<212> DNA 

<213> Homo sapiens 

<400> 44 

ggcacgagtg gaagtgtaag cagaaataca gcgagggctc aggaaatact agaataggca 
acatgctctt cctctctgct tctatctgca catctgcttt atttctttgc ctcagcagac 
tcaccatctc tgctcctcat cccgcatggt ggggaaggat gcccacccac acctccccag 



60 
120 
180 



gccatctgtt agagctccaa ccacgtggaa tgacggaatc cattctgttc tctatctctg 240 
ctctagtttc aaattcctgg ggaaaaatga cccagctcac ttcaggctcc cactcttggt 300 
ccagtgggct gcaaaatttc caagcgtagc ttctgtcagt tccttgcttt gggttaggtg 360 
aaaatgaagg gaataattgt gagctgttca gattcaccaa gaaattatct actattgttg 
ggggagaatg cccaggggac agatgcattt gggtaaggga caataacaag acactagaaa 
ggaaaatccc aattttattt tcctacagag tcagcatccc acacattttc cttcacagaa 
actgacaaat aatccatggg ggcagcttag cagatgggtt gaaaaaagcg acaggctcat 
catcagtttt caacaccttg atacatcagg cttggccctt gctacctcat gcattattta 
agcacaatgc atctccctct aattgtgtca tgtgctggag gagaatgtga agttctgtct 720 



420 
480 
540 
600 
660 
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gtctttagca aacatgtttc aagtactgtc tgtctgaaaa ccaaatggaa gagggtaaac 780 

ttgatgatcc acttgatttt agttttagga cctggatgca taggcagatg tcagtttaca 840 

aggattctgt gtactttaag gaatgttttc tgagcatgtc cagtacaaca gacgctctgt 900 

taggtagctg tagttaggat tttttggttg taagtatgtg aagatttaaa tgtatcagct 960 

cacttactca gaaaatctga ggcagtgcta gccaaaccaa atggttcaag caaatgtcat 102 0 

cagtatttgg cctcttccag tctttttact cctctatcct ctgtgtctgc ttcacttcta 1080 

cacaagcttt ctctatgtgg tggctccaga ttttatatct tctagtagat atttttttaa 1140 

aaaaaaaaaa aaaaa 1155 



<210> 45 
<211> 1112 
<212> DNA 

<213> Homo sapiens 



<400> 45 

gccggaggaa gagcgtctgc aaaactgggt tcctagaagt atagacggac ttagcttttw 60 

gtagaatttg gtgaggagca gcgcctcgtg agagcagaat ggcctggcgt ggccagtgct 120 

tcccggcagc acgcagctct gcggcctcca gaattcccct gttctgagct tgatgcccct 180 

agcctgtccc ctacctactt cctcccctcc tctctagccc tctcacaggg gtgattgcta 240 

cctctctgtt ttcttgggcc taggcaagtt ttagaggagt tcccaagcat tgttatgagg 300 

ccagtgtgct cgctgggctg ggcgggatgg cctgggcttg tgtgtggcct gagggctctc 3 60 

ctggggcctt ctcttttccc agtcaccttt ggagccacag aagcagtgca ctcattggat 420 

gtctgttctt aacacagctt ctctttctac attaaaaaaa atcattattg cattttggaa 480 

agcagtgctc atcaaaagca acttttaaaa cctattttat tgttccttta aatgttctct 540 

cccgctgaaa ctgccctgga gaggctatct gctgctcttc catttaccca catcaggtta 600 

ttctccatgt cactcagtgg agatgactcc agatgtgttt aaagmctgga caattcacct 660 

atactgtgta ggaaattacc tccttaatta cctggtmgaa ttgtcagcag acatgttcat. 72 0 

ccgatgatag tactgcagtt ttctattaat aatttgcaga cttttatcta acctgcactc 780 

atgtacagat tattaaaagt tttaaaatgt aactgatcag tattgatcaa tcattgtctt 840 

gatttttttt tacagcgtat atttctaatc atatttttta aagccaagag aactggttga 900 

atgaatgttt attttcctga aggtattttt aagataaagc ttcctaatgg cgtgtaaact 960 

ttgcatatgt atgtagtttg atacatattg tcacatttga aaatcttgtg ggttgtaact 1020 

ggttttatac aaaatatcga atagtggaaa ttgtataatt acaatcatgt aattaaaagt 1080 

attaacccaa aaaaaaaaaa aaaaaaytcg ag 1112 



<210> 46 
<211> 4023 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1049) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2758) 

<223> n equals a,t,g, or c 
<400> 46 

cccacgcgtc cgtccaaaca tcaggaggca ggcagcatgg taaatgagaa agaagccagg 60 
actgggagtc caaagtcctg gcttctatgt ctggctttgc tactaatcaa atatgtgact 120 



QMCrWNrv ^\tJr\ QQA-rc.Af\n. * i _ 
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ttttgcaaac catacctcac taaaccttac tttcttcatt tgagcgtgtt ggaccagctg 180 

tccccaggaa cccccttgga ttgatctgag aaggcaagga taagtttttc aaaggaagaa 240 

aagaggagta gtcagtccgc agtacagtag acacaagccc caggacatct gagtgtcttt 300 

cagcaagaac tctctgtgat atttcactac aatttctctg gcaccttggg actctcctca 360 

gcccttgtgg tggtgggtct tgtttaacta gcagttccct ccattctatg cctgtgaaga 420 

atctatcacc taccatgtga ttacagtgca gatttttttt tccttttcct tttctttttc 480 

tttctttttt tttttttttt tgtttgagac ggagtctcgc tttgtcaccc aggctgcagc 540 

gcagtggcgc gatctcggct cactgcaagc tccgcctccc gggttcaccg ccattctcct 600 

gcctcagcct cccgagtagc tgggactata ggcgcccgcc accgtgcctg gctaattttt 660 

tctattttta gtagagacag ggtttcaccg tgttagccag gatggtctcc atctcctgac 720 

gtggtgatcc gcccgcctcg ggctcccaaa gtgctgggat tacgggcgtg agccactgcg 780 

cccggcctac agtgcggata ttttatgaga gaggagatca caactcagtc cccaagccct 840 

caacccttaa tacatactat cgtatgaaat gcctctttcc aaattcagcc ttttctaaaa 900 

ctcaagatga gaaaactgct gatgaggctc actttctaaa ataccggaat ttgcaatata 960 

gggagaatag tttttcatgt ttctttgttt aagcaataga aagaaaggaa acttatgtcg 1020 

tttacttttc aggccataga ggttttcana acaacttgaa aacatgatca aattagccaa 1080 

acttctgata gttttcaatg tagtctgtga tcatgggata atttagcctc agttcttttt 1140 

ctgaaattgt gttttgaatg tttgatttga cttatttacc atcaaacttg ctataaggtt 1200 

attactctaa tgaataagca tattccctta attgggacaa tttactatta tttctttcat 1260 

aaagtagggc accattcacc atctatttcc tggctcttta gttatcaaaa tgttaagctc 1320 

attgctattc atcccggcac agcacttata tgagaggcat gaagctggct gaattctgca 1380 

tcattaggaa tgacacagcc tcatcacatt gacaccagtg cttgtctctc acaccaatcc 1440 

aaattaagac caactgaaaa tagtcagagt ttcctctgga gctccttttt gaagagacat 1500 

atgtttttta gtctggtggt acccaaaatt gaacaaaaaa tgggtgctgc ttctcttaat 1560 

aggcaaaact atgctgcagg ataatgtatt catgcagggt cttccagcca gaccccaaat 1620 

catccctccc ttcactagaa tttttctgtt taattcgatg gccactctcc acagggatcc 1680 

attctgtgtc ttattacagg agatgctcaa tgaatgaggg acttatcttc tagaaatgca 1740 

gctccgaggt agtctgttga gtgaaataat gaatccatta tcacaaaata aattgaaagc 1800 

tgtctgacat ttggacaatt tttattttgt ttcacattgt tctgaaaact atactgtttc 1860 

ttttctccct attatttaaa taagcaaatg atgaacagat tacaaaattg aggacactcg 1920 

aggtagggaa ggagcccctc gacaggagga tcaggacata gtaccaaggg caagagaaac 1980 

gattcaataa acactattta ctatatattt taggcatggt tctaggtaat cacatgataa 2040 

gtagttgaaa gaactgaaaa tgttttatct gcaagaaaag ggcaagtgta atatcttcaa 2100 

attttagaaa gaatgtaaat tagaatttga cttaatttgg tgtagttctt gtgggcagaa 2160 

attgaattga ataggctgaa agttataaga aggattttag ctcagtattg atactggatt 2220 

gctcatgggt ggtgagagtt actcatcact ggaagagttc aagcaggggc cataagaaat 2280 

ctcagggatt ttataaggtg attcatgctc tgggaaaagg atgccttgga ttattgtgtc 2340 

agggtacttc taactctagg attctggttt ctaagatctg gactctagtc ttgccactca 2400 

cctgccatca agaacatgtt cctcatctgc aggacaggac caagatggct ctgtctacct 2460 

taccgggttg ctgtgaggcg tgattgtgat aaaatacata aaggcagttt ttaagctctg 2520 

aagcactagt taaatgtgta gcgtatttta agattctgtt gtatgtacaa ttgtttagca 2580 

gtctctctct ctttctttct ctttctttta tcagagatag atgattttcc ctcttatttc 2640 

caccagtttg gcttttcagg gaaggtggca gctggcagaa tcccctgaca acaaaaggta 2700 

cagcaaaaaa gtggaggcct aaagaaaaca tgtgctagct ctttagcccc tgaatagnta 2760 

agtcacatgt cagcctgctc tccttcatct gtttgggagg aggcagatta gagtcacact 2820 

gtcatcatgc tctttcccct cagaagcagc tgtaaggttt ttggtagctg tcagtgctag 2 880 

caaacagtgc ttttctcaca gaactactgg aaagagtcct ggctcggaaa acttgctctt 2940 

gaaagtggca cggccagagc aggggtctct agagggtcgt gccacctcta cctgccacag 3 000 

gttccattgt cggtcaggta agttagaggc agcagttccc cacctgccct ctggataaca 3060 

gcagcctggg gctgctcctg agtcatgttt ccacttctgt cttacaggcc tcattttcct 3120 

acccatcttt ctgtaaaaat gaaagtcagg agtcttatga aacttaccat tattcaatac 3180 

aggcttttgg tttttttctt taaattagat agggttaggt aagaagtaga gttctataga 3240 

acgttcatag gaagcaacaa aagttgatct cttggtctct acaataggag aggattgggc 3300 

tagatacctt caaagctgac ttgccctaat attctagtat gaaatgattc gaaggtacac 3 360 

ctgcccctat catgtcaggc agtgagtaca gttaaaacat tgggaattgg taaaggaaag 342 0 
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aaaaaaactg 
tcctccytct 
tatccagggg 
actatcttct 
atgctctttt 
ccaacactgt 
ttccttcagg 
gaatcttggg 
aacccacctc 
catgccgtgg 
aaa* 



aaaagaaccc tttgaagtta gacaaactgt ccagagacat agtgctaaaa 
ttttcttycc acagcttcta gaattcctct ccagagctac tctcaagtta 
acaggcccct ttggctccaa cccacacgcc tgaactttaa ggatcattgg 
ctgtggccar cgcagctctc ttctgtgttc acagaatggc catgataggc 
cccacccact ggaaggctca caggcaaggt gagagaggac acagaaggtg 
cgctacagta aggacctgaa gtgactttga gaaattcacc ctcacaaacc 
agcaggcatt ggtagtgcag aggcacagat tccgtccttt accagctgca 
caagttacat agcctctgtg agcctcatcg gtaaacagtg ggggttatga 
acagggttgt tgtgaggatc caatgagttg atttaggtaa gcacctagca 
caccaagtaa gcactcaata aatcactcaa ctcctttaaa aaaaaaaaaa 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4023 



<210> 47 
<211> 542 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 
<400> 47 

agggcacgag tttttttatg actacataat gtttattgcg atctatttta aggcttttca 60 
tggatctttt cagctatgea catggttagt cataatgatt gtcattttag gtcagagttt 120 
ctcagcctta gcattgttga cattttggtt aattctttgt tgtaggggct gtcctgtaca 180 
ctgtagggtg tttagcagca tccctgatct ctacctacta aatgccagga gcaacacagt 240 
acctccagct cagttgtgac aactactgaa tgtctccaga caactactga atgtctccag 300 
acattgcaga gaaattgagt ctggttgaga agtcactgtt ttagggcata atttttgggt 360 
agactgttag attctttgtg ttcgttgtnt ctggcctgta taactcttct taattatctg 420 
ctactcaaat gtatttggga tcagccactg tcttccattt ctcttttgct cacagatcta 480 
ctccacagct. cttctccctt caaacactgt tcctcagcat cttgtttttt gcagccaaac 540 



<210> 48 
<211> 1495 
<212> DNA 

<213> Homo sapiens 
<400> 48 

cggcacgagg ctacttatat tttatgaagg acattttttg ttagatgatc tcatcctctg 60 

tgttatttgt tgattgggtt tgttttttgc ttgttggttt gtttgtttct tccatgtaag 120 

gaaaagtagt gtaagcagta ggaagaaaat gaggaagatg tattttgcat gttcttcctt 180 

tcaatgttct tacacattgt attactgcat tgtggtaata gcttctataa aatctgccat 240 

agctgggatt atgcagcttt gcaagaatct actagatttt attctaactc atattagctt 300 

gtcctatcaa cttctggaat tatctaatta ttgcttttaa aagtttcctg cctttcaacg 360 

tttccctgct atgcaaaacc tttcccagac cttggtttct taaaagaaag atgttgctac 42 0 

agttcccaat tctttcttat tacaggctca ggtgtacagg ttattctggc ttaattttat 480 

ctaatgaagc ccattccttt ttgtacatga agatgtcact taaacctatg tttacaaact 540 

aaagagacta atcactcaat atgaaaacat gaaaacattt ttgcttaaaa tattaagatg 600 

gaaatagtta aatatggatt attttgtcct tttacttttt aaaaaaagtt acatattgta 660 

tgcactgtgc tgatgcaaga attctacatt ttaatgaatt ataaaattat tctgcatctc 720 

atcacgtcac agtatttctg ctctatttat tcatatacat agaaatatat atgggcttaa 780 



at 



542 
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tcatttaaaa tttgttgcag caagaactct cctacctgta ggcaatagat tgctatgttt 840 

tcaacaaatt gtggcaaatt ctaaacagca attcttttgt acgtaatagg acatttcata 900 

ctagaaaaat aaagtaatgt ttttgacatt ggatttggtg cagtttctaa tgaagcaatg 960 

gttggttggt taatatgtct tctgtagctg ttagcattgc caaattaaaa agggtaaatt 1020 

ttatggaaat cctgagacca ggaagatatc aatttcatgt gtacttaatg gtataaagtg 1080 

ttttacagtt tctatcacca tacaaataca taaagacatt ttatagtttt atcaactata 1140 

gagctttagt ctttcaaaag taatttttga aaaacataca ttcctggcca ggtgtggtgg 1200 

gccacgcctg taaccccagc actttgggag gccgagggag gggggatcac ctgaggtcag 1260 

gaatttgaga ccaacctggc caacatggtg aaaccccatc tctactaaaa gtacaaaaat 1320 

tagccaggca tggtggcagg cacctgaaat cccagctact agggaggttg aggcaggaga 1380 

atcacttgaa cctgggaggt ggaggttgca gtgagccgtg atcacgccat tgcactccag 1440 

cctgggggac aagagtgaga cttcatctca aaaaaaaaaa aaaaaaaaaa aaaaa 1495 



<210> 49 
<211> 818 
<212> DNA 
<213> Homo sapiens 

<400> 49 

aaaacttgag tatgttgagg gaaggaatat atatatatct gggagagaat ggatacgttt 

tgtttttctg aaatggaatt agaaagatgt tcagttgtct tgtgcattct tgcaaacctt 120 

gcagttttga gagccctgtt tctgccttgt atcattttcc actgtgtatc kgattctagg 180 

agcgtgaaca gggagacaaa ggtgaagttt gtgcacacct ctgtccatgg ggtgggtcat 240 

agctttgtgc agtcmgcttt caaggctttt gmccttgttc cycctgaggc tgttcctgaa 300 

cagaaagatc cggatcctga gtttccaaca gtgaaatacc cgaatcccga agaggggaaa 3 60 
ggtgtcttgg taacctaatt tttttttaaa ttatgaaatc tgcttttata ttcaaaacta 
ttactgtcaa gtaaaataca tttttatgtg ttttcattgt gctgaagaaa aactaatttc 

agcatggaaa tatgtatgtt tggctgggtg cagcgtctca tgtctgtaat cccagcactt 540 

tgggagacca aggcaggcag atcacttgag gtcaggtgtt cgagaacagc ctggccaaca 600 

tggcaaaacc ctgtctctac taaaaataca aaaattagct gggtgtggtg gtacatgcct 660 

gtaatcccag ccacttggga ggctgaggca ctagaattgt ttgaacctga gagatggagg 720 

ttgcagtgag ctgagattgc accactgcac tccagcctgg gtgacagggt gacagagcga 780 

gactctgtct caaaaaaaaa aaaaaaaaaa aactcgag 818 



60 



420 
480 



<210> 50 

<211> 1711 

<212> DNA 

<213> Homo sapiens 

<400> 50 

ggcacgagcg ctcctgtcct gccactgagg gacccggtta ccaaccctca tgtagctcag 
tttgcccatc tgtcccggtg ctaacacaca gttctcggga gactttcccc attcccagag 



60 
120 



gagtagtgcg aaatgcgtgt acctctagtc ttaagctggg cgtttgtatt agttgggttt 180 
tctggtgtct atttagcaag tgaaagtttc tggttccctc cttcactgtg tgacctgact 240 
agtcctcctg gattgcattt atggaagttt atacgagacc tagtttccat ggaggaactc 300 
actgattccg cgagggagat ggggtactgg atgatggtct tcagccttaa ggctatgttt 360 
ccagtgtcct ctgggtgttt ccaagagcgg caagaaacga ataaatctct gacccttctc 420 
aggtgcagcc agagagacac tagcccactg atggacggac agacgtgggc aagggtccgt 
gtcactaaac cacccaccac tgccacagct gcctacaaca gacacatcag atgacactcc 
gggcaaataa atgattttca ctgaggactt actggtttta ataataggtc ctggtgtaga 
gaagtccctc aacctattgt gcaacgagtt ttgagaagcg ggtaagctgt atgttttgtg 
gttttgtttc ataaattcat ctacaggaag accaatattg actgaatgaa gctttcattt 720 
aaagagctaa aatatgcttt gtgtttttat atgtggatac tactttaaac ctaacgacta 780 



480 
540 
600 
660 
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ttcattgtat catagcttgt gatgtattct gctcatggct tttaaggtaa attgtgccat 840 

gatccactgc cattctaatt gctttaacaa gtcattacca cactactgtt acatcttaat 900 

tatgcataca gacaggtaga cttgttttac atatgtgaac taactagttg tcaaagcaaa 960 

tgcagattgt attctgcaag taaagtcttt ttctctctga aatttctagg gatgttcttt 1020 

aagtgaaatt catattaaaa ctgaagattt tagttacaag aactgagtgc agattaagtc 1080 

tttgtgattc aacatagtca agatacaact gtggatattt catggaagta tgcaataaaa 1140 

tgtctctacc tggaaaaatc tatcaagcag cgtcacagta ctgaatttga aaccagaaat 1200 

actgggtttt tatataaatg cttcatagat ttgttttatg ataaagggca cataactctc 1260 

ctaaacctca caccacctct tgaataggta taataagtcc acatcaatgc tgatgcctta 1320 

gctattatta aactcttaca gtatgatgta aagtgaaagt acaatgtaag atcattccta 1380 

ggccaacttt gaccagtttt atacagaaac atgtgccaac ttttctgttt gcaaggataa 1440 

tatcaaagca aacaccagaa agttatatct ttgatgcatt ttttcaaaat catacacata 1500 

atacacaaac caaagacaaa tgatgaatat tacgtcagaa aatataaagt cttccccttt 1560 

cttcttttgc caagaaagtc caatattttc accattttta tgcacacaat caactttatt 1620 

taagctggaa gttaatgtct cattgttttc attgttctaa ataaacacct tttcccttga 1680 

gtattgctct aaaaaaaaaa aaaaaaaaaa a 1711 



<210> 51 

<211> 749 

<212> DNA 

<213> Homo sapiens 



<400> 51 

gccaaaccag rtaataattt ccttataata catgaagtcg ttattttgca tttattttct 60 

taggtggcca atggggttat cttgggggga gacttttata ctcctaaggg acagcttggc 120 

cattaacttt caaagtttct ctaaagcagc gtcaggagat atatttggtt gtcatgacta 180 

gtggcattcc actgacatgt aatgggtaga ggctgggtag acatcctacg atgcacaaga 240 

cagcctccca caataaagaa ctgtgtggcc caaaaatatc agtgatgctg agattgagaa 300 

acttaaagaa atttaaaaat taactctata caaaatctaa tgtttgagtt ttctccatgt 360 

atctgtgact gcaatgacca gagtgactgt ccataaagaa agtgctaaga gttggctggg 420 

tgcggtggcc tacacctgta atcccagcac tttgggaggc caaggtgggt ggatcacctg 480 

aggtcaggag ttcgagacca gcctggccaa catggcaaaa ccccatctct actaaaaaat 540 

acaaaaatta gctgggtgtg gtggcacgca cctgtagtca cagctactca ggaggttgag 600 

gcaggagagg tgcttgaacc cgggagatgg aggttgcagt gagccgagat tatgccattg 660 

cactccagcc tgggtgacag agtgagacaa aaaaaaaaaa aaaaaactcg agggggggcc 720 

cggtacccaa ttcgccctat aggcagtcc 749 



<210> 52 
<211> 1091 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1079) 

<223> n equals a,t,g, or c 



<400> 52 

ggccagtggg cagggtcaca gggcaaggtc ccgcgggccg ctgggtgcgg cgacttccgt 60 

gctcccggcg agcgggcgga gagcgggggc cgcactgggg agtgtgggct gggccgcaga 120 

tgtcatgtgg cctgtktttt ggaccgtggt tcgtacctat gctccttatg tcacattccc 180 

tgttgccttc gtggtcgggg ctgtgggtta ccacctggaa tggttcatca ggggaaagga 240 

cccccagccc gtggaggagg aaaagagcat ctcagagcgc cgggaggatc gcaagctgga 300 
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tgagcttcta ggcaaggacc acacgcaggt ggtgagcctt aaggacaagc tagaatttgc 360 

cccgaaagct gtgctgaaca gaaaccgccc agagaagaat taatggagga cacagggccc 420 

tatggtccta ctgtgggtgg tgacttgtcc tgctaccatg ttgacagagc cccagaaccc 480 

acatctaatt ggctttgttg cttattctgg cccttcccac accacacagc cacacaaata 540 

ctggctgctc cttgatggcc aggcagaccc agcagcagcc gaggggccag tgaagaggaa 600 

ggccgcatct gttgtgtggt ggccacaagc actcaggcat ctgagtttac tggtgcactg 660 

ctgggaggag agttatgaga tgaacattgg ctgtcaatct ctgtgggcag gcggtttggc 720 

ctctagtggg aatggctggg atttgggcgt tgcctttagg agggatacct gcatgtctag 7 80 

ttccagtctg cactggaaag aattcaaata tgcacctggc tcccttcact attttgccct 840 

atcctttgtg ctcattctta ctgaaatctg tcttgtcagc tcaggaatgg gattccccca 900 

ggaaggaaag cacttttctg ttctgggaag cccagactgt tcactttggg gcagggacga 960 

acatgtgcct cgtgaatttg cttgaaaaca gtcaccatct tctaccccca tcactgtata 1020 

gtgaaaaacc tgattaaagt ggtatctgag aaccawaaaa aaaaaaaaaa aaaactcgng 1080 

ggggggcccg g 1091 



<210> 53 
<211> 2254 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1182) 

<223> n equals a,t,g, or c 
<400> 53 

ggcacgaggc ccgctgcaat gttatcatca cccaacctcg ccgcatctct gctgtgtctg 

tggcacagcg ggteagccac gaactgggcc cctccctgcg ccggaatgtg ggcttccagg 120 

tgcggttgga aagtaagccc ccatcccgag gcggggccct gctcttctgc actgtgggta 180 

tcctgctgcg taastgcaga gcaaccccag cctggagggc gtgagccacg tcatcgtgga 240 

tgaggtgcat gagcgggacg tgaacacaga ctttctgctg atcctgctca agggcctgca 300 

gcggytcaac ccggccctgc ggctggtget catgaktgcc acaggggaca atgagcgctt 360 

ctcccgatac tttggtggct gccccgtcat caaggtgcct ggcttcatgt acccagtcaa 42 0 

ggagcactac ctagaggaca tcctggccaa gttgggcaag caccagtacc tgcaccggca 480 

ccggcaccat gagtctgagg atgaatgcgc actcgatttg gaccttgtga ctgatctggt 540 

tctgcacatc gatgctcgcg gggaaccagg tgggatcctg tgcttcctgc ctgggtggca 

gagatcaaag gagtgcagca gcgcctccag gaggccctgg gcatgcacga gagcaagtac 



60 



600 
660 



780 
840 
900 
960 



ctcatcctgc cagtgcactc caacatcccc atgatggatc agaaggccat attccagcag 720 
cctccagttg gggtgcgcaa gattgtcttg gccaccaaca ttgctgagac ttccatcaca 
atcaatgaca tcgtgcatgt ggtggacagt gggctgcaca aggaagaacg ctatgacctg 
aagaccaagg tgtcctgcct ggagacagtg tgggtatcaa gagccaatgt gatccagcgc 
cggggccggg cgggccgctg ccagtccggc tttgcctacc acttgttccc tcgaagccgg 

ctggagaaaa tggtcccttt ccaagtgcca gagatcctgc gcacacctct tgagaacctg 102 0 

gtgctgcaag cgaaaatcca catgcctgag aagacggcgg tggagttcct gtccaaggct 1080 

gtggacagtc caaacatcaa ggcagtggac gaggctgtga tcttgctcca ggagatcggg 1140 

gtgctggacc agcgggagta cctgactacc ctggggcagc gnctggctca catctccacc 1200 

gacccccggt tggccaaggc cattgtgttg gctgccatct tccgttgcct gcacccacta 1260 

ctggtggtcg tttcctgcct cacccgggac cccttcagca gcagcctaca gaaccgggca 1320 

gaggtggaca aggtgaaagc actgttgagc catgacagcg gcagtgacca cctggccttt 1380 

gtgcgggctg tcgccggctg ggaggaggtg ctgcgttggc aggaccgcag ctcccgggag 1440 

aattacctgg aggaaaacct gctgtacgca cccagcctgc gcttcatcca cggactcatc 1500 

aagcagttct cagagaacat ttatgaggcc ttcctggtgg ggaagccctc ggactgcacc 1560 

ctggcctccg cccagtgcaa cgagtacagt gaggaggagg agctggtgaa gggcgtgctg 1620 

atggccggcc tctaccccaa cctcatccag gtgaggcagg gcaaggtcac ccggcagggg 1680 
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aagttcaagc ccaacagcgt cacatatagg accaaatcag gcaacatcct gctgcacaag 1740 

tcgaccatta acagggaggc cacacggtta cggagccgat ggctgacgta tttcatggca 1800 

gtcaagtcca atggcagcgt cttcgtccgg gactcctctc aggtgcaccc gctagctgtg 1860 

ctgctgctga ccgacgggga cgtgcacatc cgtgatgacg ggcgccgggc caccatctca 1920 

ctgagcgaca gtgacctgct gcggctggag ggtgactcgc gtaccgtgcg gctgctgaag 1980 

gagctgcggc gggccctggg ccgcatggtg gagcggagcc tgcgcagcga gctggctgca 2040 

cttcccccca gcgtacagga ggagcacggg cagctgcttg cgctactggc agagctgctg 2100 

cgaggaccct gtggcagctt tgatgtgcgc aagacagctg acgactgagc cctgcttctg 2160 

ctggggctgt gtacagagtg caaatgttta tttaaaataa agttctattt atcccttgtg 2220 

aaaaaaaaaa aaaaaaaaaa aaaaaaaact cgag 2254 



<210> 54 
<211> 486 
<212> DNA 

<213> Homo sapiens 
<400> 54 

cacactgaca tctccccaac aggtgagggc agggagagct ccagacaggg agaggccttc 60 
agagaacagg aaggaagctc cctccctcct ctgcattttg cagcctgtag ctcacgtgcc 120 
ttttatgccc cacatctcat tctgtctggg gactccatac gtagtggctg tctaccttcc 180 
cgcgtggatt gtaatgcttt tgctaccagg ggtcaggcca tactcatcac tgcaggccct 2 40 

gaagcatcca tcatgttcc.t cgagctcagt atgtgctccg tacatgtagc acagtggaaa 3 00 

aacttgagct ttgctggcaa agacagacag aatgagcttg aatctcagcc cagctatggc 3 60 

ttttctagtc ctgtggctag aaaatgactt agcctcttgg actttggtta acccatctgc 420 
aaaacaggga tggcacccac ctcttagaaa gttacagtgg tcaaaaaaaa aaaaaaaaaa 480 
ctcgag 486 



<210> 55 

<211> 1270 

<212> DNA 

<213> Homo sapiens 

<400> 55 

gaaaccatcc aagataagag acatgggagt gaaattcaca cccactctgg ctttcatacc 60 

atgggtctga acattagccc atggtgtttc ttggccatac tgacctgtgc catttcagct 120 

gcattcatct cagttggtgt tgtctgctgg ctgctctttc tgatttccca caggagcagt 180 

aagaacctga ggaagagtag ggtcagagga gtctgggaga atgaggaaat atgagagccc 240 

caggaactga aaaggcctgt gagagactct gagcttcctg ggaacaggta taggttcttt 300 

ttatttcaat aataacagaa acaactgtca aaaccatgtg cctgtactat ttggagtgct 360 

gtccttgcag aatctcatta taagaacctt aggaaatagg cacatcatct cctggataga 420 

atcctaggaa atgggcacta taatgggcac tttatcccat tttataaaca tggaaattga 480 

ggcacagaga gattaagtac tttcccaagg tcatacagct agtgatggag gagctagcat 540 

ttgaacccsg gagtttttag tctattgagt ttaaccgaca gatcatactg tgttttggta 600 

gggaggragg gtgaagcaag caartgaaca aatgartctg ggatttarga cttgccagac 66.0 

aaacaaggcc caagaggcaa gtgtgcaggt gggtgtagtt gggagtcagc agagttgggt 72 0 

tggaattcaa gctttgccac ctgctggcta taaaccttgg ttgggtaagt aacccaaggt 780 

aaatgagatc atctctgtaa aactcttagc cttgtgcctg gcacatagta aatgcttaat 840 

aagggttcac tgttagtatt actgttactg ataacataca aatagattgt attaatggac 900 

cataattgca actgtataaa acaaattcca tgtttggcca ggcgcagtgg ctcaagcctg 960 

taatcccatc acttcgggag gccgaggtgg gcagatcacg aggtcaggag atcaagacca 102 0 

tcctggctaa cacagtgaaa ccccatctct actaaaaata caaaaaaatt agccaggcat 1080 

ggtggcgggt gcctgtattc ccagctactt gggaggctga ggcaggagaa tggcgcgaac 1140 

ccagggggcg gagcttgcag tgagctgtaa ttgtgccact gcactccagc ctgggcgaca 1200 
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gagcgagact ccgtcycaaa aaaaaaaaaa aaaaaactcg agggggggcc cgaacccaat 1260 
cgccctatag 1270 



<210> 56 

<211> 2059 

<212> DNA 

<213> Homo sapiens 

<400> 56 

ggcacgagcc tcactgggta aacacaagct ctcggcaggg aacaagctct cggcagggaa 60 

tgcacagaga agcactaagt caaagtcatc tccgtgtggt cattaaaatt ccagtgaatg 120 

aaaaagtttt agggaacatt aactgttaat ggcttatgta ttagctgttc tctgttttaa 180 

aagcctttgg gcacttttca aacctaacaa acaactgatt gaatttctat tgatggtcaa 240 

agtggtaaaa atacctcttt gttatctaag acaactttta ggtggcatta agaccccgag 300 

ggtctagggg cctccaggtg tacggctttg aagtaagggc aagaagcatg ggtgccctct 360 

aggagcactg ggagggacag tggaggaaga ggccccgccc gggtgccact gctgaggcag 420 

ctcttagggc cctgccactt actttgctca gagcaaaagg tgccgtcagc tgggtcctac 480 

ccccgcctgt gcggctggat ctgggctgcg tcagcaccgg gacccgcccc agcagcacat 540 

ctgtcaaagc ggaaaacagt tcacacacca gagcctcatg tggaaagagt ccaggcgctc 600 

agacttttct ccttcatgac cctcatctct gctctcgggc ctcagcccgc ctgtcgtact 660 

gcaggaaatt ttgtttcctg cttgtcttca aagacttcag ggcttagaac ggaccataga 720 

aagatatgaa ccgagagccg tggcagctgt ccagcttgca ggctgatatt tgtgtagatg 780 

ccatgcagat aacacagcag ggtccaggcc cttccctctg gaactcacac tcggctgtat 840 

tttgggagga accgctgggc aggttctgcc caggaacagt taactctgca gagcacagtt 900 

tccacatctg ggctgagttc ccaaagaaag tgcgcctaac aacgtcttgt ggaaggcggt 960 

tggcgtcaag agaagtaaag tgttcagatg gcagtgattt taatgaacct actagctatt 1020 

ttaataggaa agattttttt ttttatccgt ttggttttta tttttagaat catgaaatag 1080 

caagttggct ctgctgaagt agaaatggtg ggcggggagt cgccactgac catcgtctgc 1140 

gcagagcacc ttcctgctga ccacagctgt gagcgccggc tgtacgcagg tccctgctgt 1200 

gcgggtgccc aagagggctg agggctgcgt ctgccatggt gtctccacct tggacaccat 1260 

gaatcatgag aggttctcag ggctgcctcc cacaggcttt ctgtgtctta cctgggacac 1320 

tcgggactag ttgtgtttag gttttcttaa aattctgtag taattgcatt gtagagcatc 1380 

cctaatccga acttcagaaa tccaaaatgc tccagtgagt atttcatttg agcatcatgt 1440 

cagtgctcag atatggtcag accctggagc actttgcatt ctgaagtgaa ggatgctcag 1500 

cctgcgtctg acagaagctc cagagaagtg gctgcgagtt cagggcaaga ggctcctggc 1560 

tttgaggctg caccgtttct ggaagtcaag ccccacagtg gacctcgagt ccctctgtga 1620 

ataggaatgt gctcgggcag ggaacccgga gcaccagccg ctgggcccct cgttccccgg 1680 

ctctgcagcg tctttcgggt cctgtgggtc tggctgtgcc cagccctgct gcccgtggac 1740 

tatagtgtct tggggtcctg caggcttggc tgtacccggc cccagtgcat ggtggcaagt 1800 

gcctgtgatc ccagctactc aggaggctga ggcaggagaa tcacttgaac ctgggaggca 1860 

gaggttgcag tgagcggaga ttgcgccatt tcactccagc ctgggcagca agagcgagac 1920 

tttgtctcaa aaaaaaagga acgtgcctca ttcagtggtt cgatggtggt tctgatcaat 1980 

gcccagaagt tctgatggag cttctgtcag acacaggctg agtatccttc accccaaaat 2040 

taaaaaaaaa aaaaaaaaa 2059 



<210> 57 

<211> 868 

<212> DNA 

<213> Homo sapiens 

<400> 57 

gactgactat agggaaagct ggtacgcctg caggtaccgg tccggaattc cgggtcgacc 
cacgcgtccg ctgaatttag gagacttttt acccaggggc aaaaggctct tagggtaatg 
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agatggatgg tggcccaggt gcattttcca gggcctgggt tctccagatc ccgtggcttc 180 

tgttgagtgg aggcaacttt gctctgtgtg aacctcgccc ctgtccctct gccgggcacc 240 

cctggcagga agcaggactc ccatcctcac cctgacttag actgtcctct gagtcagctc 300 

ctctccaaga caggagtggg cagccctggg cagtcttctg gccccttgct aaagtgaggg 360 

scaggaagct ggggctgccc tccagaaagc cggggtaggr actctgaaaa atacc.tcctc 420 

taaacggaag cagggytctc cagttccact tggcgccccc tcccacaagg cccttcctcc 480 

ctgaggaccc caccccccta ccccttcccc agcagccttt ggaccctcac ctctctccgg 540 

tgtccgtggg tcctcagccc agggtgagct gcagtcaggc gggatgggac gggcaggcca 600 

gaggtcagcc agctcctagc agagaagagc cagccagacc ccaaccctgt ctcttgtcca 660 

tgccctttgt gatttcagtc ttggtagact tgtatttgga gttttgtgct tcaaagtttt 720 

tgtttttgtt tgtttggttt ttgttttgag ggggtggggg gggatacaga gcagctgatc 780 

aatttgtatt tatttatttt aacattttac taaataaagc caaataaagc ctcaaaaaaa 840 

aaaaaaaaaa aaaaaaaagg gcggccgc 868 



<210> 58 

<211> 986 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (592) 
<223> n equals 

<220> 

<221> SITE 
<222> (669) 
<223> n equals 

<220> 

<221> SITE 
<222> (767) 
<223> n equals 



a,t,g, or c 



a, t , g, or c 



a,t,g, or c 



<400> 58 

gaaattaagt catttagata aaaatatgcc attcttatct gtttggtttt ttaatcttgg 60 

cttaatattt ggggttgagt catttgtttt gagagctgtc ctgtttattg cagggtgttc 120 

agcaacatcc cagatggaag cagcatcccc ctacccagct gtgacaaaaa gaaaaaaaaa 180 

tgtctccaka cactgccaaa tatcttctgg gggtgcccct ggttgagaac cactgcttta 240 

gtggataaac tttaggcagg agggaaatga tcgcagttgg atagttggag gaatgtggag 300 

caagggaagc aataaactgt gaccataaaa acatagaaag atggcttata tgtggatttt 360 

tttttaaagc acgtagaatt gcttaaaatg gacaacagca gcatataaat cagtggcaga 420 

gttggtggct gaatttagag catcttaagt ctatgttctc ctggaacaga gtgcagataa 480 

ttcagttatc agcttggcta ggtgcatgtt gaagtattta gtcacacaca aacagttaat 540 

gtatggggaa gataacttct atactagtag gagagaaatg gaacaagaat wntaaataca 600 

ctatcaaaat atgcaagaat ggcaagmgga aaaggcagaa caagctgcaa aacmcacaca 660 

caattagana taaatatttt gggacacaat aaatgtgaat ggattaaaac ctctgttaat 720 

gacaaagttc tccagttaaa ggaaggcaaa tagtgttatt aggaatngat tactatatga 780 

tgattaaagg ctcagttcaa caggaagatg attgatagaa ctttcctaca tttgtaacac 840 

agtcttagaa gatattaaag caaacattca agaagaaatt gatcacctac taccatagtg 900 

tattttattg aattggtaca tttcaataaa gtgtcataag gcacggttga aggaaaaaaa 960 

aaaaaaaaaa aaaaaagggc ggccgc 986 



WO 99/47540 



34 



PCT/US99/05804 



<210> 59 
<211> 695 
<212> DNA 
<213> Homo sapiens 

<400> 59 

ttttttttct tgaaataaaa tgggggagta atgggaaata atttttttga gcccttgcgt 

ttctaaaaat gtttgcattg tgccttcatg tttgacagtt cagttccagg ttgaaaatta 

tttttctttg gaatgttaac agctgccctc tattttctgt ttttatctaa tgttgctgaa 

gagaaatctt ctgattctta ttcttttttt ggtgacctgt tttaattttg tgtctttctt 

ttttttcccc tggaagcttt taggctctcc cttttatcct tgtagtctga gatctgacaa 

tgatggttgt gtctagtttt ccccaggata ctttttcatt tgtcctggtc agcactagta 

gatcctttca gtttgtgtac ttctgtttct cttgctctgg agaatttaaa aaatatatat 

atttttgaga cagagtctca ctctgtcacc caggttagag ggcagtggtg tagtctctac 

tcactgcaac ttctgcctcc tgtgtttaag cgattctcct gtctcagcca cctgagtagc 540 

tgggattaca ggtgcctgcc accacgccca gctaattttt ttgtattttt agtagaggca 
gggtttcacc acgttgccca ggctggtgtc gaactcctga cctcagatgg tccacctgcc 
ttggtctccc aaaaaaaaaa aaaaagggcg gccgc 695 



60 
120 
180 
240 
300 
360 
420 
480 



600 
660 



<210> 60 

<211> 314 

<212> DNA 

<213> Homo sapiens 



<400> 60 

gtcgacccac 

taaccaagcg 

ttcccagggg 

ccgtccttgg 

tggggaaggt 

aaaaaaaaaa 



gcgtccgctt 
aggagtccaa 
gatggggagc 
gaaaggcctg 
cccccgaggg 
aagg 



tgaggagcat 
agctcaggac 
tttctgcacc 
cattctgatc 
aggagcacgt 



tcctctagat 
ctgactaccc 
cccagtggca 
cttccaggcc 
tgctgagtaa 



tgcacaaggg 
tgagggcacg 
tctcctcatc 
cttcgagcat 
agaggtgtta 



acagtgcctt 
ctgacgcctc 
acgttctgtg 
ggaggggcac 
ctcaaaaaaa 



60 
120 
180 
240 
300 
314 



60 
120 



<210> 61 
<211> 734 
<212> DNA 
<213> Homo sapiens 

<400> 61 

gactgcttat atttggcatt gtcttttccc tggcactgcc actgtcacca ccatccccct 
tctggatccc tactttaccc cttcatgctg ctctggtggc agtgcctctg ctgccatgct 

gtacttgagc ctgctgctac agccatgcct gaagatgcag ccccttcctc tcttcctgtc 180 

ccaccaaata tgaccagctc taggttccat tacttctgga ctttgctcca aataaaactt 240 

acacaatttt attccaaacc caggtctctt tctgcaacac ccgagaaaaa tattgggctg 300 

caggagccag agaggagaga gagatttact ggtgagagtt gtaggtggga attgaaagcc 360 

aagtcatgtc tttgccccac cagaaactca ctaggatgta cacaatgcca ctgtgatggt 420 

kttaaaatat gtaactaacc tgcacgttgt gcacatgtac cctaaaactt caagtatata 480 

taaararaga aagaactgst gatacacata tcatgaaaaa agaccaaata aaataaaaaa 540 

ataaaaataa ataaataaaa taaaatatgt ccacaaatgc tttgatgttc ctttgtttct 600 

tgatctgtat gctagcaaca cag;gttcatt ccgtttgtga aaattcattg agctgtgctc 660 

ttatgagctg tgtacttctc tacatgtatg ttaaatgtgg acaagaactt cacataaaaa 720 

tcattttaaa aaaa 734 
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<210> 62 
<211> 1410 
<212> DNA 

<213> Homo sapiens 



<400> 62 

ccgcctcctt gccgcccagc cggtccaggc ctctggcgaa catggcgctt gtcccctgcc 60 

aggtgctgcg gatggcaatc ctgctgtctt actgctctat cctgtgtaac tacaaggcca 120 

tcgaaatgcc ctcacaccag acctacggag ggagctggaa attcctgacg ttcattgatc 180 

tggttatcca ggctgtcttt tttggcatct gtgtgctgac tgatctttcc agtcttctga 240 

ctcgaggaag tgggaaccag gagcaagaga ggcagctcaa gaagctcatc tctctccggg 300 

actggatgtt agctgtgttg gcctttcctg ttggggtttt tgttgtagca gtgttctgga 3 60 

tcatttatgc ctatgacaga gagatgatat acccgaagct gctggataat tttatcccag 420 

ggtggctgaa tcacggaatg cacacgacgg ttctgccctt tatattaatc gagatgagga 480 

catcgcacca tcagtatccc agcaggagca gcggacttac cgccatatgt accttctctg 540 

ttggctatat attatgggtg tgctgggtgc atcatgtaac tggcatgtgg gtgtaccctt 600 

tcctggaaca cattggccca ggagccagaa tcatcttctt tgggtctaca accatcttaa 660 

tgaacttcct gtacctgctg ggagaagttc tgaacaacta tatctgggat acacagaaaa 720 

gtatggaaga agagaaagaa aagcctaaat tggaatgaga tccaagtcta aacgcaagag 780 

ctagattgag ccgccattga agactccttc ccctcgggca ttggcagtgg gggagaaaag 840 

gcttcaaagg aacttggtgg catcagcacc cccctccccc aatgaggaca ccttttatat 900 

ataaatatgt ataaacatag aatacagttg tttccaaaag aactcaccct cactgtgtgt 960 

taaagaattc ttcccaaagt cattactgat aataacattt tttccttttc tagttttaaa 1020 

accagaattg gaccttggat ttttattttg gcaattgtaa ctccatctaa tcaagaaaga 1080 

ataaaagttt attgcacttc tttttgagaa mtatgttaaa gtcaaagggg catatataga 1140 

gtaaggcttt tgtgtattta atcctaaagg tggctgtaat catgaaccta ggccaccatg 1200 

gggacctgag agggaagggg acagatgttt ctcattgcat aatgtcacag ttgcctcaaa 1260 

tgagcaccat ttgtaataat gatgtcaatt tcatgaaaag cctgagtgta ttgcatctct 1320 

tgatttaatc atgtgaaact tttcctagat gcaaatgctg actaataaag acaaagccac 1380 

cctgaaaaaa aaaaaaaaaa gggcggccgc 1410 



<210> 63 
<211> 1231 
<212> DNA 

<213> Homo sapiens 



<400> 63 

ggcacgagtg aatgtcgagg agttccagga tctctggcct cagttgtcct tggttattga 60 

tgggggacaa attggggatg gccagagccc cgagtgtcgc cttggctcaa ctgtggttga 120 

tttgtctgtg cccggaaagt ttggcatcat tcgtccaggc tgtgccctgg aaagtactac 180 

agccatcctc caacagaagt acggactgct cccctcacat gcgtcctacc tgtgaaactc 240 

tgggaagcag gaaggcccaa gacctggtgc tggatactat gtgtctgtcc actgacgact 300 

gtcaaggcct catttgcaga ggccaccgga gctagggcac tagcctgact tttaaggcag 3 60 

tgtgtctttc tgagcactgt agaccaagcc cttggagctg ctggtttagc cttgcacctg 420 

gggaaaggat gtatttattt gtattttcat atatcagcca aaagctgaat ggaaaagtta 480 

agaacattcc taggtggcct tattctaata agtttcttct gtctgttttg tttttcaatt 540 

gaaaagtaat taaataacag attagaatct agtgagagcc tcctctctgg tgggtggtgg 600 

catttaaggt caaaccagcc agaagtgctg gtgctgttta aaaagtctca ggtggctgcg 660 

tgtggtggct catgcctgta atcccaacat tctgggaggc ccaggcggga gaactgcttg 720 

agccccagga gttcagaatc agcctgggca acatagcaat actccgtctc ataaaaatta 780 

ataaataaaa agtctcaggt gaccaaaggc tcctgaagct agaaccaggt ttggataaag 840 

attgaagagc cacaggccac tcttccctct gagccattgg gcctagtggt gtcatgtatt 900 

gtaattgctc gcagggagag cagtcttttt ggtgtaatag tgggatgtct gcttagttgg 960 

caggggttca gtccaaatgg aagaatattg ggaaataaac ctccactatc ctttatagcc 102 0 
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agggactttt ttcctattta ttcataaaat aaattatagt taattatacc cataacacct 1080 

ttatttaaat ccagtgttct ccgcagcctt ttgtctattt atatgtgtac caagtgttaa 1140 

acataattat tattgggcat ttgaactttg tttttcttta aagaaatgct gctattaaac 1200 

atatttgtaa atggaaaaaa aaaaaaaaaa a 1231 



<210> 64 
<211> 612 
<212> DNA 
<213> Homo sapiens 

<400> 64 

ggtcgaccca cgcgtccgag catttgtctg tataatttta gttattgaat taaaatcttt 



60 



tgggacccca acaggatgag atcattggcc agctggcttc ctcccacctg cacctggact 120 

gaaattcccc gtggcattag aggtgtttcg taaggtgctc cctgctgtct gtcctacaga 180 

ttgcagtggc tctgctggaa aagaacggaa ttctatgcaa gttgcgtgtg tcatgaaggt 240 

ctctgcacag tgggtgtgtt tctttgtcgt cttttctcca ctctgctctt ctgtgaaatg 300 

tgccagcagt ggacagaaca ggggcagagg tgatcagtga ccattgcaca gaatatcagt 3 60 
aagtgttgta aggtatatag tcttggccaa caaattgtaa gcaaaatacc aggaacttcc 
taatctagta ggaaattttg tatgcttttg acaaacatct gatcctactg acactgaaag 

tccttagaag gagaattgct tgaacccgga aggtggcggt tgcagtgagc caagatggcg 540 

ctactgcact ccagcctggg caataggaat gaaactccgt caccaaaaaa aaaaaaaaaa 600 
aagggcggcc gc 



420 
480 



<210> 65 
<211> 2270 
<212> DNA 

<213> Homo sapiens 
<400> 65 

tttttttttt aactttttaa acaatccatt ttaatcatct aaattattta caatacaata 
acatggattc atccttttta agacatggga ttgtaaaaat caacaagtga atgatgcttc 
aaataataca tttaaataca ttaatcaaat tttttcagtg cttaaaactt tttctccatg 
ggacagcagg ctctggacaa aagtgcctag catacaagtt ttcccaattt ccttctatca 
taccagctgc acataaaaag gttcatcacc tcctgtctcc aaagtgtctc cctactgagt 
gttcccaggc agacaatagt tcctgggata gtgctgtttg gtaacagaaa agcccaagcg 360 
tagaggacgg attaaaaggc agggaccaga ccgccatgga tacaaatccc aagacagagg 420 
atgccccatg ccttccccat gaagcttatc tgtctgcctg tgtctccatg attgcaggca 480 
tagagctact tgggacctcc aggatgattt acttagcgat atgcttttta cattctaaga 540 
atcaaaatgg tcctgtaatt cccaatagag aaaatagagc caattcattg ttctcccctc 
tcccctctga agccagtttt taaagatgag ccttacccag aaaataagcc ccaaagaact 
ctcatctaaa tgatcagacc cttcctaaat tacctttggc aacctaggta attctttttt 720 
attacacacc tccaacctga ccctttctac agtttcaact ataaatgttc atgcccctcr 7 80 

tcaaataacg ttgctaggat gaatttgcca caggtttgag tacagagaga acaagcaaga 840 
aaaatgtcag tgtttatttt aaggagagtg gccaggatgt cagtcctcat aattggtccc 
ttctctctct ctatcctcca aggtaagttc tttgttgact tgataagctt tagtccttct 
gtacaacttc tagaagatgc acttaatggt gcttctttgc acttccagaa ctcaccttct 
attctacctg taaggctgta ggggagcatc ccaatcaaca taaggcctac ccctttagcc 
acgaaaatca gccaggcatc atgtttctgc accaccacct gccttcctga cggacactgg 1140 
tgctgatgac aaaaatggga cagtaccgca gctggtttct ctttttcgag tgtgtagata 1200 
agaaataaaa aacattttca ttccctcaca agcttaatct agtaatataa ctgcctaaaa 1260 
aaaatcaaac cataaataaa cctatgtgct aaacaaatca catgacttga tgacttctct 1320 
aaaattaatg tcaaggaaaa aaggaaaagt tgatcccaag taaaatccct tgaccacagc 



60 
120 
180 
240 
300 



600 
660 



900 
960 
1020 
1080 



1380 



tgtctgaaat tagccagggg aatgggagac accaccaaga acctcagctc tttcctgccc 1440 
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tgtatttcaa ggggagtgtt gtggccttca caaatgaaaa ttatgaatca caaagataaa 
cgtcctcact tctaacctgg tgaatcctca ggaatgtcat gaggatgaca acacagggtt 
aattcatttt ttctcagtct cccccctgac tccacaaaag ctttgccttc ccaacacaag 
gggctgggag gtccagtcta gacagagcat gctgttgggg taaacagtaa ccatgtgatc 
ccatgattcc cagagctctg agcacaaagc ttttcatccc agtggcaact ggaatgtggg 
taattctgta aactcatggc cacaccttta atgcttgggg acagtgggtg gagtcagcca 
gagctctttt ccaacttcat ctagggtctt ctctctggaa aagcttagtg acgttctccg 
aaggtttatt tggttaagga gtattgctaa aacacttttt aaaaatccac tttgaacaca 
tgtgtaagct gaaaagaaaa tgacatatat acctccattg aagctgggaa agtgaaaagg 
ctgacgaaat gtctgaaatc ctgagccttt cctggttcta ttttaataca gcgtacaggt 
aacagatgat ctcatttacc ttctgaatga cccagcactc aatttcccta aaactgctca 
gctccacttg gaaatcacca ggggacttga gaatcttccc cttagactca gggagacacc 
cagaccagga agaagggcac tgatgttttc agggacccaa aagcccactt tttttttttt 
tttttttttt ggaattcgat atcaagctta tcgataccgt cgacctcgag 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2270 



<210> 66 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<400> 66 

ggcacgagcg agggaacaga ttcccagggt ggggtggggt agggctgggc ctttgctcct 60 

ttgtctcctt tcccaaggca aagtgaagag aagagagtga ctccttcttc actcagggaa 120 

cccaggcagg gatgaagcgc ctgtggtgtc tgagctgggt cccggggctg caggggagcc 180 

cctcagtgtt gtcctctgta ttcttctccg tgttcaaacc acagctgcat tggacatgca 240 

gtcaggtgtc ctctcactgg caccctccct gccttttcat tcttttttct ggatagtctt 300 

tcatcaagtt ttctctgcct tcaccttgct cttcctgaat cagttcacct tgaggggggt 360 

taacagagca ccttggcagg ctctgttcct ccaggtccca ggccagcccc cgggactcag 420 

ggcctgcctt cccctcacct tcttgagcag cacaaactcg ttctctgctg ctgtccgctt 480 

gttgatttcc tcctcatacc tgtgggaatg gcgagggctc attggttaat atctcactga 540 

aagcccttgc tttcacaggg cagcgttgag caaggagcag cgtgtccatc agaagataca 600 

ggtgctgggg gccgcagagt actgggcagg ggtaagtggg ggaaggcttc ctggaggagg 660 

gaacatgcta accagtttgg aagaatgaga ctgttaaaga tccagcttgg caaacgagga 72 0 

aggagcacat ggagcgaatg ccctgtggga ctctcagagg aaccaggatg tgaactgccc 780 

tccccaaatt tgagtacagc tttacaaatg acaaagcgct ccaatctgca tttcctcagt 840 

tacccttgag agcagtcttg gagccagatg cacttaaccc tcctcttaca tgggagagaa 900 

cgtgagtggt ttccccaaac attccctaaa cccagagcca gagataaccc tctgcccact 960 

gcccagctca ctgggcattt gtcctaagag tcaggccaga ggctggagga gcagagagca 1020 

agttccagag ttttgttggg gtgactctgc ttgatatgac caagaacaat gccctccact 1080 

gacctccaaa gcatttaagc tggggtgact gccaggggtc ccttggaggg acaagggcag 1140 

ttgtccagtt acagggggac tcctcctgct cacctcttct tgtagtcctc cactacgtcc 1200 

cgcacattcc tcagctccga gtccagcctc accctgtccc cagacagcgt ctccagctgc 1260 

ttccgcaggt tgctgatgta gcc 1283 



<210> 67 
<211> 1263 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1256) 

<223> n equals a,t,g, or c 
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<400> 67 

gaggagatcg ccacctccat cgaacccatc cgcgacttcc tggccatcgt tttcttcgcc 60 

tccatagggc tccacgtgtt ccccacgttt gtggcgtacg agctcacggt gctggtgttc 120 

ctcaccttgt cagtggtggt gatgaagttt ctcctggcgg cgctggtcct gtctctcatt 180 

ctgccgagga gcagccagta catcaagtgg atcgtctctg cggggcttgc ccaggtcagc 240 

gagttttcct ttgtcctggg gagccgggcg cgaagagcgg gcgtcatctc tcgggaggtg 300 

tacctcctta tactgagtgt gaccacgctc agcctcttgc tcgccccggt gctgtggaga 360 

gctgcaatca cgaggtgtgt gcccagaccg gagagacggt ccagcctctg atggctcgga 420 

gatgatggac cgtggaaggg aagcgtctgt ggggagtgag cgcttagatg gccagcagct 480 

gctccttctg ggaagctcgc accttggcaa cagaacagcc ctctagcaga gcgtcagtgc 540 

agtcgtgtta tcccggcttt tacagaatat tcttgtccta ttttagaatt ttccggagta 600 

gtttatttgc agtctgttga ttatgtgcag tagacccggg acactgcgtt ttaccgatca 660 

ccttgaatgt ggtgcctgga tgtgcctttt ttttttttcc ctgaaattat tattaatttt 720 

ctattgtgag ttcatcagtt catagttttt ttagtaaaga agcaaaatta aaaggctttt 780 

aaaaatgtac aacttcagaa ttataatctg ttagtcaaat atttgttatt aaacatttct 840 

gtaatatgaa gttgtaatcc tggccgtgag cttggaagct tacttttgat tcttaaagcc 900 

tatgttttct aaaatgagac aaatacggat gtctatttgc cttttattgt aacttttaaa 960 

tgaaataatt tcatgtcaat ttctattaga tatatcactt aaaatatttg gttttaaatc 1020 

acaagaatat gtattcttta ataaagataa tttatgatca tggtataatt aattgaaatt 1080 

tattaaaatc tgtttttatt aaaaaaaaaa aaaaaaaaac tcgagggggg gcccggtacc 1140 

caattcgccc tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga 1200 

ctgggaaaac cctggcgtta cccaacttaa tcgcctttgc agcacatccc ctttgncagc 1260 

ttg 1263 



<210> 68 

<211> 1617 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1578) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1586) 

<223> n equals a^t^, or c 
<220> 

<221> SITE 
<222> (1605) 

<223> n equals a,t,g, or c 



<400> 68 

tcgacccacg cgtccgggaa acctgatact gcaacctgca atgtaggatg tttgtatggc 60 

atttaaaggt aatggtgatg tttattattc tatactttgc atactgtgag agtaattttc 120 

actctgtctt aagtgtgagt aagcctcttc taaaaatctt gttcttgcca agaaatttat 180 

aaatcacata cgaagacgtc tgttgctaac agttaacttt atgaggtaac tatatccttc 240 

tatttctctg gactcatttt taaaaaatat gccgaatact gcatactgtt taaggtagta 300 

tataagttta tgagagaagt ggagagcttt cttccttgaa aagtcggtat ttgttgagat 360 

accatttgcc tcacagagag gtgttcccca ctcccatccc cattgccaga taaataaata 420 

ttttgagaaa agtgacctaa aacagctgga aatcttaggt gcatctgtct gcagacctcc 480 
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ttaagcaggc 
gttttgccct 
atactgcatg 
aagtaatgcc 
ggtactgcaa 
tatcctcaac 
tctaatttcc 
gttgtgattc 
attattttta 
tggcagactt 
ataggataat 
taaacttttc 
tttattattt 
aaggtaactg 
tttaaaatta 
aattgaaagt 
tgggcttcag 
tgccgtgtgt 
ctgtaatggg 



tgtatcttac aattccctta ctgcactggg taagtgttaa cttagttttt 
ttgctttaaa tattctccaa attaccattt atgcaacatg gttagggtta 
gtattcattt acttgtttca tgaactttcc agtactgtac aaggtcaaca 
tgtggtatcc tcatctctca cttttttact ctgtggtttt agcacagtaa 
agaccttcct tccaaatgtc tccttgactt tattccttgg gccaattcag 
atcctaagat tttgttgttt tatcactgac ctgtggttgg cctgttttat 
agaaaagttc aatcccagta tttgcaatat caaataactc taaaaccgat 
taccttcctt actattttta ctgggcaaat gccctatttt tttaattatt 
acttttggga cacacaaaaa tcagcaattc tcatgaagcg tttgttagtg 
gtctaattcc tgaaactcat tcatcccctt gagccagcca atggggagga 
gcaaacacat gttttgtttt ctcattttca aataatttac catgttaaaa 
tttgtttttt atttgtagag tcagctaagt acccatattt aaatgccgtc 
ttttgaggtc tttgtttttg tctgtttttg ttttgttttg ttttgtaaat 
ggcaatcaaa caccttttgg ggattctggc tttagtattt tatcagccat 
aatataaaaa tcctttgtaa gaaacttgca tcctaatttt tctttattgc 
gtaaataata agacaatgta agtaagacct tcctaatgtc taatacaaac 
caagtggcct atttttatta gggttttgaa aggttgtgtg tgtgtgtgcg 
gtggttttct tttttaaatg gatagtagag tggtggctgg ataagggtac 
ggtttggnca gcaagnctga aatttatact tttgnaaata aaactac 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1617 



<210> 69 
<211> 1389 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (755) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1177) 

<223> n equals a,t,g, or c 
<400> 69 

gctttfettag gcattcattg gacacttgct ttaataagaa tagtttctta gttggcataa 60 
tgcttctgaa gtaatggtac tttaaaataa tttatctcat gaactttaag cattcctctt 120 
gaaatcttgt ttttactcta tctagtcagc ccttataggc aatccaaagg gatgctttgg 180 
atgctttagt ccagtggttc tcagagagtg gtctgtggag tcctggaagt cgctgagacc 240 
ctttcaggca atttgcaagc tcaaaactaa tttcagaatg acaccaggat gttctgtgcc 300 
ttttttgctg tgttggctat ttgcactgat gatgcaagag aaatggggag gagtaaaatc 360 
actggtgtct taccactatt caagacagtg gcaccaaact gtagtagtct agtaggcatt 42 0 

gtatcctgca cagcctcccg ccaggagtgt gatggaagac caaggaagca gtgtaaatgg 480 
aggtacacgg gaagcacttc ctttgcaccg tgaaggatga tgactgtttt aaggaaaagc 540 
acttatgcca tggtttgtgt cgtgagctga accagctgcc tttttcatgg aacatgactt 600 
ttacttgaaa gagtaactga cagaaaacca attattctga cttgtgtttg tagcagacat 660 
tttcttgaaa atgaagaagt gggtctgtga cttcaggaaa atgtctgaca gtatctgttg 720 
ccaaagaaaa tgtgagtttt caagccaaga atagnaatct agaaaacttg tattcmcctc 780 
catgggcttg atgmcctctt agtacttaga cctttctgac gagataagcg gtggcattaa 840 
caaatgtgac gtttttcatg ttatctaaat acatttgtca acatttrraa gatctgcaca 900 
actccctgga ccaggattty ccaaatgatt gttgcttttt gttacaaaat cagggaatag 960 
atagaagatt cattcaaata atgaaagata gactgatgga tttttatgta acagaatagg 1020 
aaaagtttat gacatgtttt cagattccac cttgcaacta atttttaaga agctaccact 1080 
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tgtcagccag gcacagtgct cacgcctata atcccaacac tttgggagtc caaggtgggc 1140 

agatcacttg aggtcaggag ttcaagcctg gccaacntgg tgaaagactt ctctactaaa 1200 

actacaaaaa ttagctgggc atggtggtgg gtacctgtaa tcccagctac tctggaggct 1260 

gaggcaggag aattgcctga gcctgggagg cagaggttgc agtgagccaa gatcgcgcca 1320 

ctgcacacca gcctgggcaa caagtgcgaa actctgtctc aaaaaaaaaa aaaaaaaaaa 1380 

aaactcgag 1389 



<210> 70 

<211> 1896 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1802) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1856) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1886) 

<223> n equals a,t,g, or c 



<400> 70 

aaaacaaaaa agctaataat ctcctcaagc aatttctggc ctaatagaat tatagtagac 60 

agtgaagtat ctaaacccag ggaatcagat tgaggcacca tgtccatcgc cttgagaatt 12 0 

aataggctgc atttctgggt tctccttttt tttttttttt ttgcccaact gagtctttct 180 

gtggacttac atggaacttc ttattctctt aaatcattaa gttacttgac aatattcttg 240 

gatttggaga aactggatgt agggccgtat gaaaaaatca ttcgaaatca gatttagggg 300 

tataaggttg gataggaatg ttttagaaag aagaatgtaa ggcagataac taatttgtca 360 

catccaaagt ataaaactgc tactttttcc ctagaaaagg gaagctcatt ttaggcagcc 420 

taaaccagta agattttctt cctcctccaa gtgcagattt ttgtaccttt cgtttgtcaa 480 

aacattcttt ggccctatgc atgccagagt gatatagaaa ggaagttacc acattttttt 540 

gagaacaaat cactcctgat aaaatttctt. agacaattga taatcatttt aagaagaaat 600 

ttaattgtat ttagctctgt gtctcgcccc tttggtgtca ctcttctacc tcttccatca 660 

ctatagctaa atatttagaa gtatatcttg acacctagca caaatgtttt ggttaagtat 720 

cttaaaactg atggatggta tggctggggc agcatggctc acgcctgtaa tcccagcact 780 

ttgggaggcc aaggcgggtg aatcacctga ggtcaggagt ttgagaccgg cctgaccaac 840 

ttggagaaac cccgtctyta ctaaaaatac aaaaattagt csggtggtgg cgcatgcctg 900 

taatcctgtc tactcaggar gctgargcag gagaattgcc tgaacccggg argcarargt 960 

tgcartgagc tgaratcgtg ccattgcact ccagcctggg caacaagagc aaaactcagt 1020 

ctcaaaaaac aaaacaaaaa acctgttggt atagtacgaa agaaacgtct tgcagttttc 1080 

tgttgcagag aattaattag aaccaacctg ttggattata cacattcacc tttcagaatc 1140 

ctttcttctc tgtggaaacc cacactctca gcagtgtgtg ggaacacagt agattcttaa 1200 

ggaatgcttg ttgaatgttg cagtctgcat cttcttgaag taacagaact gttggtagct 1260 

gtttaaaagt aaaatgtgtc taaagacctt ttggaaatta agatgtaaga gattaatgca 1320 

ccaaagcagt ctcttaatta cttaaaatga attatttcaa agaatcttta attgaatttt 1380 

ctgtgaagtc tggaatttgt aaattatgtc cctttgttca aaccagcccc tgaaaagaac 1440 

aattaaggca attaagatag cattaaagtt ttcaatgaag ttggcatttt cygtgtatta 1500 

agattagatg ttagctgctg aagtttgtgg aggtcggaca taaagcttcc aacatcagta 1560 
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atgcaaaatt gtcttgaacc tgcgataaaa ttttgttgga cttttttttc attgcagtgr 1620 

aaagggccat gtagcatgcc tcaaagccag gttactcagc ctagtccttg tttaagcagt 1680 

tttgatattc atycaagttc aattttcyca cctgatttwa kgattaattt cctkggaaaa 1740 

attttgaaaa gttttccaaa gaaagtaaaa aatttaaata atccggtaac cccgtataat 1800 

angaggatta aaccttccag gttccaaatg gttttgggtg gcaattttcc cttccnaagg 1860 

tccctttttt ttcccaatgg gtaatnaaat aattaa , 1896 



<210> 71 

<211> 308 

<212> DNA 

<213> Homo sapiens 



<400> 71 

ggcacgaggc ggcgctgcga ggacccatgc agctgacgct ggggggcgcg gccgtgggcg 60 

cgggcgccgt gctggccgcc agcctgctct gggcgtgcgc cgtgggcctc tacatggggc 12 0 

agctggagct ggacgtggag ctggtgcccg aggacgacgg gacggcctcc gcggaaggcc 180 

ctgatgaggc gggtcggccg ccacccgagt gagcgacacg gccgtggggc ctggcaggcg 240 

ctggacagcg cccgaggact gggacattaa acctgacctc ccctcctcca aaaaaaaaaa 300 

aaaaaaaa 308 



<210> 72 
<211> 1688 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (912) 

<223> n equals a,t,g, or c 



<400> 72 

acccacgcgt ccgctcatgt ggacttatgc cagtctagag gcagaatcag aaggcttggt 60 

tgaacatatc gctttccctt tttcctctcc ctccgcccct cccagtacag tccatctttc 120 

aatgttgcag cctggttgag aaggagagaa aaaggtggca ggaatttcca ggagatcccc 180 

aagaatgctg ccttgtctgt ggacaaagat ggaccatgtg cccttcggaa ttagggatag 240 

aaacaaatat tgtgtgctct taacgattaa gctgtgttat ggtgggtttt caggttttta 3 00 

ccttttttct ttaccccttt actctgcaag aatggggaaa gaatgcatac tgcgaaaatg 3 60 

agtcttttaa attctgtctg cctactagtt ttaagtatat ggtatgttgt aaaatttcca 420 

atgatgagag acagcacaat aaatgtacct tatctcctta ggctgaaggc cataactaca 480 

tagtggagta atttaagaac tctcttgcct tcaccaaccc aaaaggttgc tttttgatag 540 

caactggcta atgaattttt aaaaagagaa gaaaaatact agttttcccc tcttttggga 600 

aatagatttt aaatggctaa actactagcc ttaaaactac tagtctataa atcaactacc 660 

acttttgtga atctgacagg ccacattttt atatggccct ttacagaatg gagtgtgttg 720 

aacaggatac taacgccatg gagttgagct gggcctagcg atggagggac actctaacac 780 

aactttccct cagctattat gcaacagatc agggaaaaag atgggatgac agatggggtc 840 

agacagaaag agcttctggg aaacaagctt acatagtctt ttttaaaatg cacaaagcct 900 

cccagctaag angtcacttg gtttgggctt cattaggact ggagactttg ttggagttct 960 

ttctgggaac ttggagagtg gatgatattc aggctctgaa acattcccag cgctctcccg 1020 

agggtgccac tttctcaaga tgaaaactgt gactgaaaaa attaataata aatgtttctg 1080 

agctgcctgt gttctccctg tgtgggtgag agaagggact agactcctaa gcctgcctca 1140 

gatacaagag ggatcattgg ctccaatttt agagaacttg aaagcaaggc tttggacaaa 1200 

attttgagac cctaatcact ttaccttcct ccaaattacc caacatacgg taaacaacat 1260 

ttgtgcagaa gtatgtatgt atttagttca ggttgacttg tgtccttata aactgttact 1320 
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caaatgattt 
gcctgtggta 
tcgttttgtt 
agagagctca 
ccaaattttt 
gcttgtttct 
taaacatc 



gaacttttat gcgactggga tttttttttt 
tcgaggtgtt gcaaacaata tctgtgttgc 
tgtttttgtt tcactgttca tcacagcagt 
accggcaatc tcaggtgcat ttaacatttt 
cttcttaaaa aattggaagt ggggggaatc 
ggagaaaata attactgtaa atggaacaac 



ccaaagctac 
gcttcctgtt 
gttatctcca 
taaaacgaaa 
caatgacaaa 
aacaacaata 



aagcatggcc 
ttaacctacc 
ggagacatat 
cagtagttga 
aactaatgtg 
aaacacacgt 



1380 
1440 
1500 
1560 
1620 
1680 
1688 



<210> 73 

<211> 1138 

<212> DNA 

<213> Homo sapiens 

<400> 73 

gggcgcctgt agtcccagct attcaggagg ccgaggcagg agaattgcct gaactcagga 60 

ggcggasttg cagtgagccg agatcgcgcc attgcactcc agcctgggtg acagagtgag 120 

actctttctc ccaaaaaaaa aaaaaaaaaa aaagtcaaat gcagctggga atgtggttcg 180 

tgcctttttg tatattaacc atttgaaact tggttgtaag gtggggttgg caatgtcagg 240 

cctggctgca gcagctcatg tctttagagt gtgcctcttc cctctctcgt ggggctcgag 300 

caagactacc ttcatacatg ggctctccag ttacatagca actccagtgt taaattccat 360 

cttttcttcc tggaaaagcc gtagaaagga cacctggaca tgcctgctgc acaggttgtc 420 

tgccttcccc atcagccsca gaaggaggaa ctttgctctc ttctctcaca gctgtgtgtg 480 

cataagaagt agttcggatg atgtgggtcc caccatgtat tccttctctg ttccatgtag 540 

agtaaaataa atgggagttc tgtttaatgc atcacctcgg ttcatattgc atttgccaag 600 

aaagtgcaat tttattgaac attaggattg aattcttaac tgagtaatca atttcagtag 660 

taagttaaaa tgccttctat taatggacaa ctgcaaccgt taatcagagt tacagtagat 720 

taacagttgt cagcatttat gctaatagca ctaataaacc gtgggctcat gatttgcact 780 

ttataattcc atatttctca aaacagttgg taatactttt tgcttgaagg tattgattct 840 

tttgtccctt tgcttgctac ttggagatgt agagaaagct aaatgacatt ttcacggtga 900 

tgacacaata tcaccttctg cttttgcaca cttggctttg tgtcaaaata gatggaaagg 960 

gttcatttgt tctggtgctc tactgtttaa tttgatctgg tgtgtgacta aagcaagaca 102 0 

aatagtattt ttaatgaaac catttaataa cctctggtag cttagagtcg aaggcattgg 1080 

aaaaatgcaa ttaaaggatg cctagatgta aacaaaaaaa aaaaaaaagg gcggccgc 1138 



<210> 74 

<211> 777 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (761) 

<223> n equals a.,t,g, or c 
<400> 74 

gtagcacctt gaaattgggc agttctgaat atgctgttga gtaaagggaa aatcactatc 60 
tttttaggac ctttggaatg tggctccatg catttgctaa cgttgttctc ttcagggctg 120 
atttttctgg gctgttctac tcctctatcc ttctgtgatt gtcttccaat tcttttatta 180 
tggttagagt tccctgtaga aaccagtggg gtgtgtagtt aacaagtgtc aaaaggagta 240 
gaataattac tttatgtgat gtacttacga gaatactact tagtagaatc cagtataacc 300 
aaacaaaatg tagacgtatt taactatcca gtgtgtcagc tacacatttt ctcatcttta 360 
tcacttctgc tctggatatt gtgatctact ttcctatctc tttgtatttg tkttttcaca 420 
tttctttttt gtgtaaactg ctctatactc ttattgaaac ttgagcataa ttttatattt 480 
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acatagtaaa gtttctgata tgattagaac ataagttgtw tctcctaatt ttccaataga 540 

gccattgatk tttcatttta acccctttta atggaacact tactagcttt ctaaattata 600 

aacttactaa ctttaaatat acatgaatta tttctcagca ttcttaaaac aagggtactg 660 

aacttcgwtt tcttgatgat tcaggggaaa agaattctga gtgttggaaa ggactttaga 720 

tgtcttccaa ggatttatgg gatcaattta agaaaaaaaa naaaaaaggg cggccgc 777 



<210> 75 
<211> 1060 
<212> DNA 

<213> Homo sapiens 



<400> 75 

gatgtatttc cttaatatgt agtttcagaa gtggaattta ttagagttaa actaaactca 60 

ttaaatttag agtttcttat tgtctttcat gagaacattt ttccttttca ttcataaatg 120 

atattgaaac actatattct tactttcata ttcctgttta tatttttgtt tttcatgtta 180 

aacattttac attctaatag taacctcatc gacctgttaa aaggcaatat aagatttaga 240 

ttattaaata -gcatgtaata tatgtgatca gtaatcttca atgagcttgk tcttcattta 300 

attgcaacgt tatgtctgat tttttttgkt gcaaagcttt cagaatcttg acttgtggta 360 

atcttctttt aaaaaagctt ttaacagaat taatargtca tcacgttatg ataaatgatt 420 

aaggaaatga tgcctctaat acatkgaatt attaaaacta tcattttgaa aaattatatt 4 80 

ggtacaaact agtgtctact gctattactc atacatttca gaattcatac atggatatcg 540 

tctaggattt tttttttgcg taatcatgag ttacggtgtt aaagttatag tgttaattta 600 

attatgttat agtgttaatt tatctgtttt acatctcact tttgtatctg aaaccgttcg 660 

aaaataatta ttattaaagg ccagttgaca aaatttccac tcctcctccc cagtgtgact 720 

ttccttattt gtattatacc tataaagact acctcttaca tcggccaggc acagtggctc 780 

acgtctgtca tcccagcact ttgggaggac gaggtgggca gattgcctga gctcaggagt 840 

tggagaccag cctgggtaat atagtgagat cctgtctcta caaaatatac aaaattagct 900 

aggcgtgcct gtagtcccag ctacttggga ggctgaggtg gtaggatggc atagagtcca 960 

ggaggcagag gttgcagtga gctgagatgg tgccactgca ctccagcctc agtgtcagag 102 0 

ccagtccctg tctcaaaaaa aaaaaaaaaa gggcggccgc 1060 



<210> 76 
<211> 1503 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (6) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (18) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (41) 

<223> n equals a # t,g, or c 



<220> 

<221> SITE 
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300 



<222> (1501) 

<223> n equals a,t,g, or c 
<400> 76 

gtggangccg ctcctganaa ctagtgggtc ccccgggctg ncaggattcg gcacgagaat 60 

gaatggcaaa gaaatagaag gggaagaaat tgaaatagtc ttagccaagc caccagacaa 120 

gaaaaggaaa gagcgccaag ctgctagaca ggcctccaga agcactgcgt atgaagatta 180 
ttactaccac cctcctcctc gcatgccacc tccaattaga ggtcggggtc gtggtggggg 
gagaggtgga tatggctacc ctccagatta ctacggctat gaagattact atgatgatta 

ctatggttat gattatcacg actatcgtgg aggctatgaa gatccctact acggctatga 360 

tgatggctat gcagtaagag gaagaggagg aggaagggga gggcgaggtg ctccaccacc 42 0 

accaaggggg aggggagcac cacctccaag aggtagagct ggctattcac agaggggggc 480 

acctttggga ccaccaagag gctctagggg tggcagaggg ggtcctgctc aacagcagag 540 

aggccgtggt tcccgtggat ctcggggcaa tcgtgggggc aatgtaggag gcaagagaaa 600 

ggcagatggg tacaaccagc ctgattccaa gcgtcgtcag ccaacaacca acagaactgg 660 

ggttcccaac ccatcgctca gcagccgctt cagcaaggtg gtgactattc tggtaactat 720 

ggttacaata atgacaacca ggaattttat caggatactt atgggcaaca gtggaagtag 780 

acaagtaagg gcttgaaaat gatactggca agatacgatt ggctctagat ctacattctt 840 

caaaaaaaaa aattggctta actgtttcat ctttaagtag cattttgctg ccatttgtat 900 

tgggctgaag aaatcactat tgtgtatata ctcaagtctt tttatttttc ctcttttcat 960 

aaatgctctt ggacattatt gggcttgcag agttccctta ttctggggat tacaatgctt 1020 

ttatcgtttc aggcttcatt ttagcttcaa aacaagctgg gcacactgtt aaatcatgat 1080 

tttgcagaac ctttggtttt ggacagtttc atttttttgg atttgggata gattacatag 1140 

gagtatggag tatgctgtaa ataaaaatac aagctagtgc tttgtcttag tagttttaag 1200 

aaattaaagc aaacaaattt aagttttctt gtattgaaaa taacctatga ttgtatgttt 1260 

tgcattccta gaagtaggtt aactgtgttt ttaaattgtt ataacttcac acctttttga 132 0 

aatctgccct acaaaatttg tttggcttaa acgtcaaaag ccgtgacaat ttgttctttg 1380 

atgtgattgt atttccaatt tcttgttcat gtaagatttc aataaaacta aaaaatctat 1440 

tcaaaacaww aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1500 

1503 



<210> 77 
<211> 872 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (844) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (858) 

<223> n equals a,t,g # or c 
<400> 77 

ggggaagttc ttcactgcct tgcatttgac tccagatccc tccatcctcc cagagccttg 
gcctcaaaaa tgctgattct agcatcatgg aaatgctgtc ctcaaagtgg tctaaacggg 
ttgctgcttc acttgctcac ttaatctccc ttttcatagg gctgttgttt ttacttctgg 180 
gaagttctgt ttaccctgga acagaaactc tcttccctaa aagttgattt tattgaccca 240 
tggaggccag agacacttag gcatattttc cctccagact agaagcttct gaggaggacc 300 
tcctgagtct gcaccctggc tccctgctgt gctgagggcc cccgtgttaa cctcacgttg 360 
tgcctcctct gattcagagg gcccagtgtg gttctgtcag ccaggcagtg gccccagctc 420 



60 
120 
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tacagaaatg agttgtcatt gcatcctagg gccagggtct tcgtgcttgt gtgtgttacg 480 

tggaagtatg tggacaccaa gtgttcctgg atggccacag cctgcgaagg aaactggggc 540 

cagcagctgc tctgtgtttt cagccaacaa tggctcctgc ccactgccgc tgcataacca 600 

ccagaggcag gcttctcttg acacaggcct gtcgttggag catgtgcctg gcgagtccta 660 

tttctattcc cctgtgggtt agggacaggc agctgtacct tcagtgtgtt gctggggcag 720 

gagaatcgct tgaaccggga ggcggaggtt gcagtgagcc aaaattgcac cactgcactg 780 

cagtctgcag gacagagaga ggctmtatct caaaaaaaaa aaaaaaaaaa actcgagggg 840 

gggnccggga cccaattngc catataggaa aa 872 



<210> 78 

<211> 573 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (560) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (563) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (566) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (567) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (571) 

<223> n equals a,t,g, or c 



<400> 78 

gatcaagttc cttagttttg acaatcaggt cccaaactct ttttcttgcc tcatttattc 60 

attcaacaag tattttttgt gcactgaata tgtggccctc actaggcaga tgctgcctat 12 0 

tcttttgcct gttaactaat ttaacctctt gtcatacctc ccaaatcacc ttatgctcca 180 

gagaaacttg tgtatggtca cgtaccacat aatgatgctt tggtcaacta cagatagttg 240 

atccagtaag attaccatgg agctgaaaaa ttccaatggc ctagtattta ctgtgctttt 300 

aattattatt ttggaatgta ctccttttac ttataagaaa cttagctgta aaacagcctc 360 

agtcattttc ttcatgaggt atttcagaaa atggcattgc tatcatagga gatgacagat 420 

tcatgcattt tattgcccct gaagactttc cagtgggaca agatgtggag gtggaagaca 480 

gtgatattga tgatcctgac cctgcataaa tctagcttaa tatgtgtgtt tgtcttagtt 540 

gctgacaaaa aaaaaaaaan aanaannaaa naa 573 



<210> 79 
<211> 1509 
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<212> DNA 

<213> Homo sapiens 

<400> 79 

ggcacgagga tgtacctaat gagcttctcc attcactttg taaaaataat ttgtatgtgt 60 

accatcttgg tcctctcccc tcccgttttg ttaaaatatc aggatagcac tcccaggcca 120 

ctttggtctc agtgtaagat ccctattaac tatctgaaag gaaaatagag ccaagacctc 180 

tggtctcaaa tatataggaa ttgcctttct ttagtcttca ggactattgt gtgaaaacaa 240 

gtaggggtct aatctcctag aaggtagggg ctttatcctt aaagagaa.ta tgtccccaga 300 

ttattagcac ttttagagga gaagccaagg tatgtagggg tgtgtggctg gcccatcagt 360 

ggagcacgaa gagagaatgg gataccattg tgggaagaga agaaaagttc ctcaggggcc 420 

tcccactgct aaagtttttt gtgagatgtt gatctgtgct tcctggattt gacttttaaa 480 

ggaattattc tggcagcaca tgtagtattc ttggatgatc ttgctgctct tatttctcct 540 

tttgtgtgtg tgtgtgtgtg tgtgtggcta tgggttttca tttgtaactc catctgctta 600 

ggagagtggg ctctctataa gggaacctgc tgtaaacttc attgcagcaa ggatgtagag 660 

agaaatagga cttaattcca ctaggggctc tcatctcaca ccttaaggag gagatttcta 720 

gaaaaactgg gccagatttt ctttgttctc catcatttta atgtggcagg ctgttcagtt 780 

ttcttactct tacctatgtg atatttcttc gtaacgtgtc caaaaagaaa aaagacccaa 840 

tcagtgtctc ttgactttgt tctttgatcc ctcagtttct tcttgatttc agcatgtgtc 900 

gggttcctaa ttttgggtat gagttagcaa atttaaccat tgtgtttgtg ccctacccag 960 

gggactcccc agtttctgac ttgaagtaga ctgagaagaa tccacgaggt gctatctggc 1020 

cagatttaag tagattctat ttccttggtt ctccctctcc ctgaggacct cttattttat 1080 

tgtcccctct tctaggttaa ttctcctttg atttgacttt gttgagaagg aggttggaca 1140 

gtagattagc aaagttccaa gtgcaaaatt acagtgtgtt agagtgtggg gggaaaatta 1200 

gtcttatttt tccctacatg ggatacaaca ctgtgaattc aatcttcaac tgaaggccct 12 60 

gcagttctcc taaaacatag ttgtttgttt ttctttaaca aagtttaagc tagtgttaat 1320 

aaattaaaaa aaattgcttg tctgtctact tcagctttgt tttatgccca tttcatattg 13 80 

ttgtctgtgt tgtaattcat aacttttgat accatttctg atgtgtaaaa ttggttgtct 1440 

tgtaaatatc ttataaagag ttcaattgta aataaactat tgtggctgtt aaaaaaaaaa 1500 



aaaaaaaaa 



1509 



60 



<210> 80 
<211> 1109 
<212> DNA 
<213> Homo sapiens 

<400> 80 

ccacgcgtcc ggccgcagaa cgggctccgc ggacgacggg ctccagggac gcacaggcag 

cgggcctccc accgcgggtg ccgggggcgg gggggctgcc cccatgcggg gcccttcctg 120 

gtcgcggcct cggccgctgc tgctgctgtt gctgctgctg tcgccttggc ctgtctgggc 180 

ccaagtgtcg gccagggcct cgccctcggg gtccctgggc gccccggact: gccccgaggt 240 

gtgcacgtgc gtgccgggag gcctgccagc tgtcggcact ctcgctgccc gccgtgcccc 300 

cgggcctgag cctgcgcctg cgcgcgctgc tgctggacca caaccgcgtc cgtgcgctgc 360 

cgccaggtgc cttcgcggga gcgggcgcgc tacagcgcct ggacctgcgc gagaacgggc 420 

tgcactcggt gcatgtgcga gccttctggg gcctgggcgc gctgcagctg ctggacctga 480 

gcgccaacca gctggaagca ctggcaccag ggactttcgc gccgctgcgc gcgctgcgca 540 

acctetcatt ggccggcaac cggctggcgc gcctggagcc cgcggcgcta ggcgcgctcc 600 

cgctgctgcg ctcactcagc ctgcaggaca acgagctggc ggcactcgcg ccggggctgc 660 

tgggccgcct gcccgctcta gacgcgctgc acctgcgcgg caacccttgg ggctgcgggt 72 0 

gcgcgctgcg cccgctctgc gcctggctgc gccggcaccc gctgcccgcg tcagaggccg 7 80 

agacggtgct ctgcgtgtgg ccgggacgcc tgacgctcag ccccctgact gccttttccg 840 

acgccgcctt tagccattgc gcgcagccgc tcgccctgcg ggacctggcc cgtggtttac 900 

acgctcgggc cggcctcctt cctcgtcagc ctggcttcct gcctggcgct gggctctggg 960 

ctcaccgcct gccgtgcgcg ccgccgccgc ctccgcaccg ccgccctccg cccgccgaga 1020 
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ccgtccagac ccgaaccccg 
gagccccgtc cgctgccgcc 



atcccgaccc ccacggctgt 
caagcctga 



gcctcgcccg cggacccggg 



1080 
1109 



<210> 81 
<211> 807 
<212> DNA 

<213> Homo sapiens 
<400> 81 

cccacgcgtc cggacgtcct gatagatcct ctgctccaat aggcaactcc ggccttccct: 60 

gccctgacct ggaacctctg ggagggctgc agagtaagtg ccgcctctgc gctccgacgg 120 

aggcacgagg cctgtggagt aggtccctct gttccgacag gtgcgacact tggcgctcca 180 

tgcttgcggg tgccgggagg cctggcctcc cccagggccg ccacctctgc tggttgctct 240 

gtgctttcac cttaaagctc tgccaagcag aggctcccgt gcaggaagag aagctgtcag 300 

caagcacctc aaatttgcca tgctggctgg tggaagagtt tgtggtagca gaagagtgct 360 

ctccatgctc taatttccgg gctaaaacta cccctgagtg tggtcccaca ggatatgtag 420 

agaaaatcac atgcagctca tctaagagaa atgagttcaa aagctgccgg ttcagctttg 480 

aatggaacaa cgcttatttt ggaagttcga aaggggctgt cgtgtgtgtg gccctgatct 540 

tcgcttgtct tgtcatcatt cgtcagcgac aattggacag aaaggctctg gaaaaggtcc 600 

ggaagcaaat cgagtccata tagctacatt ccacccttgt atcctgggtc ttagagaccc 660 

tatctcagac agtgaaagtg aaatggactg atttgcactc ttggttcttt ggagccttgt 720 

ggtggaatcc ccttttcccc atcttcttct ttcagatcat taatgagcag aataaaaaga 780 

gtaaaatggt aaaaaaaaaa aaaaaaa 807 



<210> 82 
<211> 1043 
<212> DNA 

<213> Homo sapiens 



<400> 82 



ggcacgagtt gggccgggca 
tggtttattt ggcaggtgcc 
agaaggaagc cactacaggc 
tacggatttc taggagatga 
gaagcatggg tatttatatg 
ggcattgggt ctgcgtttca 
gtcacaggcg cccacagtca 
ataggtggga cacaaatacc 
gaaggtgtgg agttagattc 
ctttttttct ttttcttttt 
cagtggtgta atcagggctc 
ctcagcctcc tgagtagctg 
atttttctgt agaagtccag 
ccgaggcagg tggatcacct 
accccgcctc tactaaaaat 
cctggctcct tgggaggctg 
gtgagctgag atcatgccat 
aaacaaaaaa aaaaaaaaaa 



cccccagaag ctgaccttga 
cagaaagtgc tgacaggagt 
tatgttcatg tgcaggttac 
cgtggaatac acctcagtgt 
tcagctccca ttcattattg 
agcctgctgc acataggctg 
tgctcagaca gcacatacag 
accagttata aagaggaaag 
ctgggtcaga tgtgaacccc 
ttttgagaca ggatctcact 
gtggcagcct ctacctccta 
ggactagagg cacacatcac 
gcgcagtggc tcatgcctgt 
gaggtcagga gttcaagacc 
acaaaaaaat tagcctggtg 
aggtaggaga atcacttgta 
tacactccag cctgggcaag 
aaa 



gacaaggatt tgggtgcaag 
gggaaagtga gttaggggag 
t gc t g t gggc aac t ggggc t 
tgccccacca gaagggcaag 
gctgagggca gctcctagag 
agaggaatcc ctgagttcga 
gaacagtgac tgcagggggc 
atgggaagga aagacaagag 
tggctctcaa aacactcctt 
ctgttgcaca ggctagagtt 
ggctcacatg atcctcccac 
cacacttggc tagtatttaa 
aatcccagca ctttgggagg 
agcctggcca acatggtgaa 
tcgtggcagg ctcctgtaat 
cccagaatgt ggagcttgca 
aagagtgaaa ctccatcgca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1043 



<210> 83 
<211> 1173 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (548) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (603) 

<223> n equals a,t,g, or c 



<400> 83 

gctgtctcag aaaaaagaaa aaagtttcta aagtaaaaat tgaaagtact tcccctacaa 60 

ccacaggttg ctttgacaga ttaatgtaaa ttcttccaga tactcttctg tggatgtaga 120 

aacatgcaga atgaggcaag ctttaatttg cttatgtcac ttactgtgga tagcctttca 180 

tatcttataa gttaatgtca gagcagcaat ctcatttttt tccaatttgt aaacatttta 240 

tttaacctta tgatggatat tttggtggat ttcagtatta caaaaatgcc tattaatagt 300 

atattttcat tatatttctg ttacgaaatt ataatgctac aaacattact atgcctgtgg 360 

cagtatacat ctgcacaagt tttgaaaatg ttatgcattc ataggcaaaa atgggataac 420 

ttttgggcag tggtcatgat taatctgttg atcagaatcc agagattgcc cttctccttg 480 

ccaattgctt taagagtacm ctagtttttg gccgggtgca rtggctcatg cctgtatccc 540 

agcatttngg aggccaagac gggcggatca caaggtcagg agatcgggac catcctggct 600 

aanttggtga ggccccattc tactaaaaat tccaaaaaaa cccacmaaaa ccaaaaaaac 660 

ccrgccttgk tggtgggatt acaggcatgt gcmacaacac ccggctaatt ttttttgtat 720 

ttttagtaga ggtggggtgt caccatgttg gccaggctga tctcaaactc ctggccttaa 780 

. gtgatctgcc cacctcggac tcccaaagtg cggaattaca ggcgtgagcc accgcgcccg 840 

gccactggtt tttaaacttt attttgaaat tatttcaggc tgggcgcagt ggttcacgcc 900 

tgtgatccca acactttggg aggccgaggc gggcggatca cgaggtcagg agatcaagac 960 

catcctggct aaccccgtct ctactaaaaa tataaaaaat cagccgggca cggtggcagg 1020 

tgcctgtagt cccagctact cagtgggctg aggcaggaga atggtatgaa cccgggaggc 1080 

ggagcttgca gtgagctgag atcacgccac tgcactccag cctgggagac agagtgagac 1140 

tctgtctcaa aaaaaaaaaa aaaaaaactc gag 117 3 



<210> 84 
<211> 1561 
<212> DNA 

<213> Homo sapiens 



<400> 84 

ggcacgagtg aggctcatgt ctgacctgca gaactgtata atgataaatt atgttgtttg 60 

aaaccgctac atttgcggta atttgttaca gcagcaatag aaaacgaatc ccctgcccag 120 

aatgacttcc tcctttcctg tcggacgaag gctcaggcct tctgctgaaa gcttgctccc 180 

ctaagtagtc acacccaatg ccgaatactc cccagaagca gctgctattt tctgaggaca 240 

atgagttgct tgtaagcctg agaacaggac gaaaacccac tttgcaagca gccctgcgtg 300 

tgacgggcat gccctcggag ggcaggttgg ttttgctatc tgctttctgt cctgcctttt 360 

tccccccatg ggtcctgtct ggctcttttg ctttctcact ttgtgcagaa agccatctca 420 

actcttctca caggagaata gctgtatgga cgtagcaggg ggagttacca catgcctacc 480 

tccatggttt tcgagagggg cccctgccca aatgtctcag tggccacctt catcagacca 540 

tggagcagtc agagcgggaa gggattctag agttggtcca gtccaaccat ctcatcttac 600 

atgtgaagga ggaaaggaag aaagggagaa aaataagaaa gctgaggtca accctcctac 660 

agggatgggc ctggccaaca ggatcccaag ggatgacata acattgaaat taagaaacca 720 

aggaaagttg agaactaaag aaaacagaac ccagtcagcc aagaggcatc cttgagggcc 780 

aaccaagccg tcaaacctgg atgcccccga cgagtcagaa agtcgggtgc ctcaagagcc 84 0 
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aagcagccaa gaaatggggt ctggaactgg cactttggtc cgcctctgtg cactcaccca 900 

gaaagggtgg aagggaccct gggaccaagt gccaaggtca cacaacggat gaatagactg 960 

ctggacttca aactgaacat gccattttgc caaagcagtc atcaccttcc gtgaatcata 1020 

aatgtttgtt caaagccaca aatgtatata ctctttgtat gtatacagat tttttctaaa 1080 

ggttaacatc taaacagatc aattaaggtc agccttaatt tgtctgagct ttttggttaa 1140 

agtttcctga gtaattgagc gaattcaagt ttctggcttt ctcctttctc tttctccatt 1200 

taaaacatga tctcatgaaa tttttgtccc aagaaaggca ggattacatt ttcttttaac 1260 

agtttgagtt ggtgtagtgt attcttggtt atcagaatac tcatatagct ttgggatttt 1320 

gaattggtaa atattcatga tgtgtgaaaa atcatgatac atactgtaca atctcagtgc 1380 

cacaaaattg gatgttgtgc ctacacacgc acaggaccta gaagagcatg tcaaactata 1440 

aactgcctgt gattgtgaat gactttgttc tttgcttctt gcgtttttca gtttcctata 1500 

atgcacatct taacttttaa aaaataaagg ttattttaaa agccaaaaaa aaaaaaaaaa 1560 



1561 



<210> 85 
<211> 1433 
<212> DNA 

<213> Homo sapiens 



<400> 85 

cccggagccg tggacgccct acagctgaga aggggaccca aggggtcggc cgcggccaag 60 

gcccctagga ccgccgcccc agctcacgct gccgacggca ttatkagaca ttctgcgtca 120 

ggtccgggct cctggacttc gcctttcccg agccctggag gtggggagaa aaggttcacc 180 

aatttttaaa atccaaatat atctcatggt acagtggaag aactggccag agagtctgga 240 

agtttgggtt ctggtcctgg ctgtgccact gactcactgt gaccttggga tcttgtgctg 3 00 

tgaagacatt tcccaagtgc ttcatgttag ccagcaaatc tgacccacaa ggcctggaaa 3 60 

gaggtgattg ttaggttgcg cagaggtggt cttatccagc tcagcttccc ctgggaccca 42 0 

ccgtgggacc tgaggcagaa ctggggtgga cttggcctcc tccatggcac accggctgca 4 80 

gatacgactg ctgacgtggg atgtgaagga cacgctgctc aggctccgcc accccttagg 540 

ggaggcctat gccaccaagg cccgggccca tgggctggag gtggagccct cagccctgga 600 

acaaggcttc aggcaggcat acagggctca gagccacagc ttccccaact acggcctgag 660 

ccacggccta acctcccgcc agtggtggct ggatgtggtc ctgcagacct tccacctggc 720 

gggtgtccag gatgctcagg ctgtagcccc catcgctgaa cagctttata aagacttcag 780 

ccacccctgc acctggcagg tgttggatgg ggctgaggac accctgaggg agtgccgcac 840 

acggggtctg agactggcag tgatctccaa ctttgaccga cggctagagg gcatcctggr 900 

gggccttggc ctgcgtgaac acttcgactt tgtgctgacc tccgaggctg ctggctggcc 960 

caagccggac ccccgcattt tccaggaggc cttgcggctt gctcatatgg aaccagtagt 1020 

ggcagcccat gttggggata attacctctg cgattaccag gggcctcggg ctgtgggcat 1080 

gcacagcttc ctggtggttg gcccacaggc actggacccc gtggtcaggg attctgtacc 1140 

taaagaacac atcctcccct ctctggccca tctcctgcct gcccttgact gcctagaggg 1200 

ctcaactcca gggctttgag gccagtgagg gaagtggctg gccctaggcc atggagaaaa 1260 

ccttaaacaa accctggaga cagggagccc cttctttctc cacagctctg gacctttccc 13 20 

cctctcctgc ggcctttgtc acctactgtg ataataaagc agtgagtgct gagctctcac 13 80 

ccttccccca ctaaaaaaaa aaaaaaaaaa actcgagggg gggcccggta ccc 1433 



<210> 86 

<211> 1377 

<212> DNA 

<213> Homo sapiens 

<400> 86 

ggcacgaggt ccagtcctga ttccatcttc ttacaagtta gggagctggg tccaggcctg 60 
gatccatgtt attatgaatc aggaagttgg gtccaggcct ggctccatgt tcctgcaggt 120 
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cagggcaggt cttcccccga gtgatggctc ttggactgtg ctcctctggg gccctctcaa 180 

ctctgtgtct gtcatctgtc acctgcctgg ccattatggt tttgatggca gtggatgggc 240 

tccatgggac ttcaggcctg gggtgagact caggaccctg gggtgggcat ggatggggat 300 

attggacccc tgaaagaagg gaagctgaga gacttttttc ctttaaagac ttttccatgt 360 

tatctccact cagagaattc ttttctgcaa agtcacggga gggaggtgac attgagccct 420 

ccaatgtgac agaaactgtg ctgggaactt tacatgtgtt acctaatttg tttaattatc 480 

ccagcaactc eacaaagtag gcatttttat tgttgaggaa acagaagctt agagactttg 540 

tgagacttgc ccgagacccc aggtcacaca ccagcaagga tgaggtcaag cttttaatcc 600 

aggtctgcct ggctccaagt ccacaccctt tcacaacaat gaactttctt tatgattgca 660 

gatattattt ggggaacttt acatcaaaca ttgactacat aaaacttcaa ccatagacta 720 

tattctttgt tttggaaact gtgaagactc aaatttttta taaactcaga acagcttcca 780 

gttttctcta gatatcggaa gatgggctgt gttttttgtc tgttgtccag tgaggctgat 840 

ttgtagtcag acaggtgagt cagtttggtt ggagtaggct attgtggttc tctctcatca 900 

ggaaagaggg gatgcacttg gcccctcaac tccaagttgg tggtgcgatg atttttccat 960 

attctccctt aacaggctgt gagggagtct gggccaggca ctaggccatg agcagggcag 1020 

actggggtaa acccttagcg agcctctctc cagccacgag gaaacctgga gtgtgtgcgt 1080 

gcctgtgtgc tgctggtgtg tgtgtgtgaa tgcacacgtg tgtgcatgca ctgtgagctg 1140 

gtgtgtgcat gtgcactggt gtgtgcgttt gtgtgtgtgt gtgtgtgtgc atgtgtgtgc 1200 

tgggtgcaca catgcatatg tctctgtgta tacatgtgta tgtgtgccag tgggtgcatg 1260 

tgtttgtaca gtgtgcgtgt gtgtgtgtgt ttgtgcacat gagctgctgc acacatataa 1320 

gccttgtgaa ttaggggaag aagaaaggct ccggcttaca aaaaaaaaaa aaaaaaa 1377 



<210> 87 

<211> 1715 

<212> DNA 

<213> Homo sapiens 



<400> 87 

ggcacgaggg acattggagc tccccacacc actcattgct gcccaccagc tatacaacta 60 

cgtggctgat cacgccagct cttaccacat gaagccattg cgaatggccc ggccaggggg 120 

cccagaacac aacgagtatg ccctggtgtc ggcatggcac agttctggct cctacctgga 180 

ctctgaggga cttcgacacc aggatgactt tgatgtgtct ctgcttgtct gtcactgtgc 240 

tgcacccttt gaggagcaag gagaggctga gcggcacgtt ctgcggctac agttcttcgt 300 

ggtgctcacc agccagcgag agctcttccc caggctcact gctgacatgc gccgcttccg 360 

gaagccaccc agactgcccc ctgagccaga ggctcctggg agttcagctg gcagccctgg 42 0 

ggaggcctca gggcttattc tagcgcctgg accggctcct ctgttcccac cactggctgc 480 

agaggtgggc atggcacgag cacggctggc tcagctggtg cggctggctg gagggcactg 540 

ccgtcgggac accctttgga agcgcctctt cttgctggag ccaccggggc ctgatcgact 600 

gcggctaggg gggcgcctgg ccctggcaga gctggaggaa ctcctagaag cagtccatgc 660 

caaatccatt ggggacatcg acccccagct ggactgcttc ctatccatga cggtctcctg 720 

gtaccagagc ctgatcaaag ttctcctaag ccgcttcccc agagctgtcg ccatttccaa 780 

agcccagact tgggaactca gtacctggtt gcgctgaatc agaagttcac tgactgctct 840 

gcgctagtgt tctggactcc acttaggaaa gacgtctctg aagtggtttt ccgagaagcc 900 

cttccagtac agccccagga cacgagaagc ccccctgccc aactggtctc cacctaccac 960 

cacctggagt ctgtcatcaa cacagcctgt ttcacccttc tggacccgcc tcctctgaag 1020 

ggagtggact ggaccactga atgtcactgt tccttgaatc atgggcctac cagattgcct 1080 

gccagaggca ggactgacca gcccttctgg gccccagggc aagccagaca ctgagtgaca 1140 

ccaaaggctt tgtaactatg tcttgagggt ctgctgcccc agcctggcag caggaaccgc 1200 

cctccccaaa cacccacagc cactgaccca tccaggactc cagagagtca ggtcaacccc 1260 

gaggacccct tgggcccttc tggggtactc ctttcggccc ccctggtaga gtctcgggag 1320 

ttcacacagg gtggcaaaca ccccctagag ctcctctgcc tgaatcctgc cccctagcct 13 80 

ttgaccactg tcagccacct gtgtcccttg agccttcggg tcttcacttc ccacttggac 1440 

atcactgctg gacattccca tcgagatgac acctgggttc caatcccagc tctgcctttg 1500 

aagcacttgc ggccaccgtc aagtcccttt gctctcggac cctgggtttc tcatccttta 1560 
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atgaggtggg ttcagaagct ctcccatctt cacagcaacc ctggcactgg cttctcaatg 
ggagggaagt cagcagagaa actgaagtgt tagacactat gtgtcccacc accccattac 
agagacatat gacaatgaaa aaaaaaaaaa aaaaa 



1620 
1680 
1715 



<210> 88 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<400> 88 

ccacgcgtcc gctcctctag aggctccaca tgaagtccca gtgctacagt cctagttatt 
ttgccttctt ctgcctggtt ttctttcaga tcacctcagc cagttctcag acacttaggg 
gacatgttct ctgcaggacc actctgaggg actcttctgc atattgctga cctgagagga 
tggcctcaga gctgacttgg gcaatcctcc ccaacaggaa ggggagacat tgcctgccac 
tgaggaaaca ggtcatgaag gtggagataa gctgcaaggg gcgaagcaac tttatgtcag 
tggaaaacgt gtctctttaa agctgctatg tgaacagctt ttacagtcat taaatttacc 
taaactaagg ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 



60 
120 
180 
240 
300 
360 
417 



<210> 89 

<211> 1167 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (432) 

<223> n equals a,t,g, or c 



<400> 89 

gggggtgggg 

gcagatccca 

gcagttggaa 

ctttcaaata 

tttatttaac 

aggtcggaca 

tggtctgttt 

gtggagccga 

cccctttctg 

ggaccttttg 

tcaacctggt 

accagatgag 

gggtgagggc 

aaggactcgg 

tgacagcttg 

agaagacatt 

ttcttagagc 

acttgatttt 

gtgtaaatgg 

aaaaaaaaaa 



caggcgacgg tggggaagat ggcgtaccag agcttgcggc 
ccggtcagcc gcgcctacac cactgcctgc gtcctcacca 
ttgatcacac cttttcagtt gtacttcaat cctgaattaa 
tggagattaa tcaccaactt cttatttttt gggccagttg 
atgatttttc tatatcgtta ctgtcgaatg ctagaagaag 
gcagactttg tatttatgtt cctttttggt ggattcttaa 
gtgagcttag ttttcttggg ccaggccttt acaataatgc 
angaacccct atgtccgcat gaacttcttc ggccttctca 
ccctgggtgc tcatgggatt ttccttgttg ttggggaact 
ggtattgcag ttggacacat atattttttc ttggaagatg 
ggaataagaa ttctgaaaac accatctatt ttgaaagcta 
gatccaaatt acaatccact acctgaggaa cggccaggag 
cagcggcttg gaggttaaag cagcagtgcc aataatgaga 
tgatacccac tgggatcttt tatcctttgt tgcaaaagtg 
gcagatttta actccagaag cactttatga aatggtacac 
tccaacagtt tgccagtggt tcctcactac actggtactg 
caraaaactg gagaaacaaa tatcctgcca cctctaacaa 
tatggtataa gcagagcctt ttcttcctct tcttgataga 
aagtttcaga gaggacaaaa taaaacggaa ttccattttt 
aaaaaaaggg cggccgc 



tggagtacct 
ccgccgccgt 
tctttaaaca 
gattcaattt 
gctctttccg 
tgaccctttt 
tcgtctatgt 
acttccaggc 
caatcattgt 
tatttcccaa 
tttttgatac 
gcttcgcctg 
cccagctggg 
tggacacttt 
tgactaatcc 
aaagtgtaat 
gtacatgagt 
tgaggccatg 
ctctcactgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1167 



<210> 90 
<211> 1892 
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<212> DNA 

<213> Homo sapiens 



ccacgcgtcc gcgggaccgg acggatcttc tccggccatg aggaagccag ccgctggctt 
ccttccctca ctcctgaagg tgctgctcct gcctctggca cctgccgcag cccaggattc 
gactcaggcc tccactccag gcagccctct ctctcctacc gaatacgaac gcttcttcgc 
actgctgact ccaacctgga aggcagagac tacctgccgt ctccgtgcaa cccacggctg 
ccggaatccc acactcgtcc agctggacca atatgaaaac cacggcttag tgcccgatgg 
tgctgtctgc tccaacctcc cttatgcctc ctggtttgag tctttctgcc agttcactca 
ctaccgttgc tccaaccacg tctactatgc caagagagtc ctgtgttccc agccagtctc 
tattctctca cctaacactc tcaaggagat agaagcttca gctgaagtct cacccaccac 
gatgacctcc cccatctcac cccacttcac agtgacagaa cgccagacct tccagccctg 
gcctgagagg ctcagcaaca acgtggaaga gctcctacaa tcctccttgt ccctgggaag 
ccaggagcaa gcgccagagc acaagcagga gcaaggagtg gagcacaggc aggagccgac 
acaagaacac aagcaggaag aggggcagaa acaggaagag caagaagagg aacaggaaga 
ggagggaaag caggaagaag gacaggggac taaggaggga cgggaggctg tgtctcagct 
gcagacagac tcagagccca agtttcactc tgaatctcta tcttctaacc cttcctcttt 
tgctccccgg gtacgagaag tagagtctac tcctatgata atggagaaca tccaggagct 
cattcgatca gcccaggaaa tagatgaaat gaatgaaata tatgatgaga actcctactg 
gagaaaccaa aaccctggca gcctcctgca gctgccccac acagagcctt gctggtgctg 
tgctattcga tcgtggagaa tacctgcatc ataaccccea cagccaaggc ctggaagtac 
atggaggagg agatccttgg tttcgggaag tcggtctgtg acagccttgg gcggcgacac 
atgtctacct gtgccctctg tgacttctgc tccttgaagc tggagcagtg ccactcagag 
gccagcctgc agcggcaaca atgcgacacc tcccacaaga ctccctttgc agccccttgc 
ttgcctccca gagcctgtcc atcggcaacc aggtagggtc cccagaatca ggccgctttt 
acgggctgga tttgtacggt gggctccaca tggacttctg gtgtgcccgg cttgccacga 
aaggctgtga agatgtccga gtctctgggt ggctccagac tgagttcctt agcttccagg 
atggggattc cctaccaaga tttgtgacac agactatatc cagtacccaa actactgttc 
cttcaaaagc cagcagtgtc tgatgagaaa ccgcaatcgg aaggtgtccc ^atgagatg 
tctgcagaat gagacttaca gtgcgctgag ccctggcaaa agtgaggacg ttgtgcttcg 
atggagccag gagttcagca ccttgactct aggccagttc ggatgagctg gcgtctattc 
tgcccacacc ccagcccaac ctgcccacgt tctctattgt tttgagaccc cattgetttc 
aggctgcccc ttctgggtct gttactcggc ccctactcac atttccttgg gttggagcaa 
cagtcccaga gagggccatg gtgggagtgc gccctcctta aaagatgact ttacataaaa 
tgttgatctt caaaaaaaaa aaaaaaaaaa aa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1892 



<210> 91 

<211> 523 

<212> DNA 

<213> Homo sapiens 



cacagcaaag caagttctaa gagccaagct tcagaccaat cccccaccgt jaagtccccc 6 
gtgcgagtgc cccttgaagg aggttttcta acaggtgagt ggtctgattc tgtctctgtc 
ctgtgggatg gatgggctgg cacttgatgg ctctccttcc ccctcacccy ^cacggagaa 
ggctggaagg tgcatttctc agacttcctt gcctgggaaa tgggaagtga tgcagaggat 
cccaacgtct cctcggcagg cttggtggtg gacgtgctgg gccatgttcc aggggccagc 
tgctggctcc gtgggtgctg agaggaaggg ggaaggctgt ctattttttg gccaggatga 
atccagcaga tgtggtaggt cctggccgct tgctgacccg tgggtctacc SWtflctccg 
gagctaatgg tccccagatg ctccaccgtc ctgatgtggc agaggcatgg cattttggcg 
ggccagtttg gtggcatcct gggaaccgtt ttggaggctc gag 



180 
240 
300 
360 
420 
480 
523 
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<210> 92 

<211> 1382 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1382) 

<223> n equals a,t,g, or c 



<400> 92 

gccggctggc agcacgactc gcgtagccgt gcgccgattg cctctcggcc tgggcaatgg 60 

tcccggctgc cggtcgacga ccgccccgcg tcatgcggct cctcggctgg tggcaagtat 120 

tgctgtgggt gctgggactt cccgtccgcg gcgtggaggt tgcagaggaa agtggtcgct 180 

tatggtcaga ggagcagcct gctcaccctc tccaggtggg ggctgtgtac ctgggtgagg 240 

aggagctcct gcatgacccg atgggccagg acagggcagc agaagaggcc aatgcggtgc 300 

tggggctgga cacccaaggc gatcacatgg tgatgctgtc tgtgattcct ggggaagctg 3 60 

aggacaaagt gagttcagag cctagcggcg tcacctgtgg tgctggagga gcggaggact 420 

caaggtgcaa cgtccgagag agccttttct ctctggatgg cgctggagca cacttccctg 480 

acagagaaga ggagtattac acagagccag aagtggcgga atctgacgca gccccgacag 540 

aggactccaa taacactgaa agtctgaaat ccccaaaggt gaactgtgag gagagaaaca 600 

ttacaggatt agaaaatttc actctgaaaa ttttaaatat gtcacaggac cttatggatt 660 

ttctgaaccc aaacggtagt gactgtactc tagtcctgtt ttacaccccg tggtgccgct 720 

tttctgccag tttggcccct cactttaact ctctgccccg ggcatttcca gctcttcact 780 

ttttggcact ggatgcatct cagcacagca gcctttctac caggtttggc accgtagctg 840 

ttcctaatat tttattattt caaggagcta aaccaatggc cagatttaat catacagatc 900 

gaacactgga aacactgaaa atcttcattt ttaatcagac aggtatagaa gccaagaaga 960 

atgtggtggt aactcaagcc gaccaaatag gccctcttcc cagcactttg ataaaaagtg 1020 

tggactggtt gcttgtattt tccttattct ttttaattag ttttattatg tatgctacca 1080 

ttcgaactga gagtattcgg tggctaattc caggacaaga gcaggaacat gtggagtagt 1140 

gatggtctga aagaagttgg aaagaggaac ttcaatcctt cgtttcagaa attagtgcta 1200 

cagtttcata cattttctcc agtgacgtgt tgacttgaaa cttcaggcag attaaaagaa 1260 

tcatttgttg aacaactgaa tgtataaaaa aattataaac tggtgtttta actagtattg 1320 

caataagcaa atgcaaaaat attcaataga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 

an 1382 



<210> 93 

<211> 1747 

<212> DNA 

<213> Homo sapiens 



<400> 93 

ccacgcgtcc ggctacctgt gcatcgtgct gctcatgctg ctgctgctca tcttctggat 60 

cgcgccggcc catgggccca ccaacatcat ggtctacatc agcatctgct ccttgctggg 120 

cagtttcacc gtgccttcca ccaagggcat cgggctggcg gcccaagaca tcttgcataa 180 

caacccgtcc agtcagagag ccctctgcct gtgcctggta ctcctggccg tgctcggctg 240 

cagcatcatc gtccagttca ggtacatcaa caaggcgctg gagtgcttcg actcctcggt 300 

gttcggggcc atctactacg tcgtgtttac cacgctggtc ctgctggcct cagccatcct 360 

cttccgggag tggagcaacg tgggcctggt ggacttcttg gggatggcct gtggattcac 420 

gaccgtctcc gtggggattg tccttataca ggtgttcaaa gagttcaatt tcaaccttgg 480 

ggagatgaac aaatctaata tgaaaacaga ctagattgca ataggagctt ggatggttcg 540 

aggaataggc attggaggtg gtttctggcc gtgattggat gtgaagtaga agaggtcctc 600 

gatcatggtg ttagaattga ctggatagta acaggtggtc tggtggatag cggggagcat 660 

ggctcagcac cagagcagag gcccagcagc ctctgcagcc caaacgtccc aacggtgcct 720 
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ggaccatctc ttctgatgag acgaatctca ttttcatttc cattaacctg gaagctttca 780 

tgaatatttc ttctttaaaa cattttaaca ttatttaaac agaaaaagat gggctctttc 840 

tgggtaggtg gtacatgata gcagagatat ttttacttag attactttgg gaatgagaga 900 

ttgtgtcttg aactctgcac tgtacaggat gtgtctgtag ttgtgttagt ttgcattaag 960 

catgtataca ttcaagtatg tcatccaaat aagaggcata tcattgaatt gtttttaatc 1020 

ctctgacaag ttgactcttc gacccccacc cccacccaag acattttaat agtaaataga 1080 

gagagagaga agagttaatg aacatgaggt agtgttccac tggcaggatg acttttcaat 1140 

agctcaaatc aatttcagtg cctttatcac ttgaattatt aacttaattt gactcttaat 1200 

gtgtatatgt tcttagatta gaataatgca acttcgagta tgctttaata tttcaatatt 1260 

caagttacaa atgtataagg cagttagaaa taatacagtc acatgtcact taatgatagg 1320 

gaaacattct gagaaatgca ttgtaaggtg actttattgt gtgaacatca tggagtgcac 1380 

ttatacaaac ctagatggga cacctatgac ccacccaggc cagatggtac agcctgttgc 1440 

tcctgggcca cacacctgta cagcatgtga ctgcactgaa taccgcaggc aattgtaaca 1500 

cagtggtgag tatttgtgtt tacaaacata ggaaaggtac agtaaaacta tggtattaca 1560 

atgttatggg accaccgtca tgtaagtggt atgtctttga cagaaacatg gttacgtggt 1620 

tcatgactgt atattcactg gaagatagtc aagactaaag acacattaga gcaaattgac 1680 

ccctttaaca tgtgattatt gtccaattaa agacagttga tttaagtagc aaaaaaaaaa 1740 

1747 

aaaaaaa 



<210> 94 

<211> 600 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (553) 

<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (560) 

<223> n equals a^t^g, or c 
<220> 

<221> SITE 
<222> (589) 

<223> n equals a,t,g, or c 



<400> 94 

gaattcggca cgagcggcac gagccgagat cgttctgggg ctgctggtat ggacgcttat 60 
tgctggaact gagtacttcc gggtccccgc atttggctgg gtcatgtttg tagctgtatt 120 
ttactgggtc ctcaccgtst tcttcctcat tatctacata acaatgacct acaccaggat 180 
tccccaggtg ccctggacaa cagtgggcct gtgctttaac ggcagtgcct tcgtcttgta 
cctctctgcc gctgttgtag atgcatcttc cgtctcccct gagaaggaca gtcacaactt 
caacagctgg gcggcctcat cgttctttgc cttcctggtc accatctgct acgctggaaa 
tacatatttc agttttawag catggagawc caggaccata cagtgattta ccattttgat 420 
aattaaaagg aaaaaaaaag gaagactctc actgtaaaaa cagctgtagg tataatgtat 480 
attcccagag aattgtattt aactaattaa tgttttttat attcttaaat ttgctcacaa 540 
attgtggttt gtnacaattn aactgggtta ctttatttgg caagtgttnt aggcttttaa 600 



240 
300 
360 



<210> 95 
<211> 586 
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<212> DNA 
<213> Homo 



sapiens 



<400> 95 



ggcacgaggt 
ttctcattaa 
agtttgatgc 
ttaaattaca 
tagttagtgt 
tttagccccc 
ttaccttgct 
gtgtaaataa 
aaaatctact 
ataataaagt 



tttttccttt 
aaatacccta 
tgccccctag 
attagattta 
tgctaataag 
agttttccaa 
ccaaaaaaga 
caacaatttt 
ttctagatta 
ttgaaagtgt 



ataacggaag ttttataatt catcttttat gtaagtgtaa 

aagcttaaag tttgcaaggc tgcccagcct aacccacaac 

cgtttgattc ccttcacctt ttgctaaaat aaggtaatgt 

cttactgctg taaatctggt ctattttagt ttcctctggg 

atggacgtaa gtgtttttga actggtgaat tctgattgct 

ataggggtga attttgggta gagatagaac aatcaccaag 

aatttacgta tgggattgtt ttcaaagcgg gaagttagct 

atatatttaa tctgggcttc tccttatctt gaatgatata 

atttagttcc atataacttt gtattgcttt gactgtactg 

taaaaaaaaa aaaaaaaaaa aaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
586 



<210> 96 
<211> 802 
<212> DNA 

<213> Homo sapiens 
<400> 96 

ggcacgagcc ctcctccctg ctcgccccca gattcccctc ccctccctgg tgcttttgtc 60 

tggagggtgt tatgggtttg tgtgtgtatg agcgtgtgtg tgtttttgga tttcagacta 120 

attttctgga gtttctgccc ctgctctgcg tcaccctcac gtcacttcgc cagcagtagc 180 

agaggcggcg gcggcggctc ccggaattgg gttggagcag gagcctcgct ggctgcttcg 240 

ctcgcgctct acgcgctcag tccccggcgg tagcaggagc ctggacccag gcgccgccgg 3 00 

cgggcgtgag gcgccggagc ccgggtgagc agcgcagata gtgccctcgg tcgcctcggc 3 60 

cctcactgtc tccccctggg gcggcctcgg ctactcccca ggtgggacgt gccgcgccac 420 

ctgcccgcgc caccggcacc cagcggccgt ggcggattct gcagcatcat tcgggggccc 480 

cgtcgcggag ccaaagccgc cggcagtctc cgcattcccc tttaaagggt ccttcgcccg 540 

gcctgtacca tggaatcctg tcttggggac cctttcccta cctcccctcc cttggcctca 600 

ggctcgaaga gagagtgggc acactggtgg ctccagcggc gtcagtgcca tcgcggggca 660 

agttgattcc tgggcactca tccatccaca gtctccgggc tggggtcggg gtggggatga 720 

cgcgagcaga gagggagagt gccccaatta gtggtgttgg gggtcctacg ctcagtctta 7 80 

cgcgtgtctg tttgtcctca gc 802 



<210> 97 

<211> 1226 

<212> DNA 

<213> Homo sapiens 



<400> 97 



ggcacgagca 
tttccatggc 
tcttactgca 
tgtgttaact 
tctgctttca 
tatactaccc 
taggctcttg 
gaggggcaga 
aggcaagttg 
ccacataatc 
gcttgccgta 



tgctttgctt 
tctgttatgc 
gtaacactgg 
caggatgcca 
ctgtgactgg 
tgggctctgg 
tttgagttta 
acagatagat 
gtcttataga 
tctctaggtc 
gatttggggt 



acaatggagt ctgcagtgag gggagatgct gggatagcca 
aagcacaaat ttcatctcct agatggactt cctggttttc 
ccttcccttc tctaattcct taccccagct gcggcatccc 
agtggccctc agattacact tctccagata gctgaatgag 
gacctgaatg acctgcagtc agggcccaga gttgggactc 
tctgtaggtt tgtagtagcc accggtaata agccaagggc 
tggccacctg gaattttcag tcatctcatg atacaggcgg 
tacgacaggt ttggttttta aattttccaa ccaagtggaa 
aagcactact gcacttagta gctatgtgat tttgagcaaa 
cattttccta accacaagat aaagatgtta cattgtcaaa 
gaatgaaaat tattccttgc tttcatcact acctttatag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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ctctcatcac tacctttata gctcatcact gtgccttttt ttctttccta agaaagacat 720 

cacatccctc tcctctcctc ctctgtgctc ctgtccctcc ctccccctag caaggtccag 780 

gcaaagctgg agatgaagct gaagatccag agtttcctag aacgcaactt aaggatggct 840 

aaggaaaggg aagcctgact gctcggtcag gagggtgcag tatctcttgc tgggaacaca 900 

gccagtttcc acaatgccta gactgtgtat gtctatttgc acaagattgg cttttcctat 960 

tttggagtgg tcagacattt tatttttgtt caagattatc tggcgtttta gacaaatttg 1020 

caaaactgtg cttttattga ctttttgaat aaactttggt attctggagc aaatgtattt 1080 

atttattggt atgtgcaatg acaaacttgg tatttttccc atgtttgaca tttatgttat 1140 

gtttgttaga attttagtgt ttgtctaagt acacacatat atcaacaaat taaacttgaa 1200 

tcgtttcaaa aaaaaaaaaa aaaaaa 1226 



<210> 98 

<211> 1120 

<212> DNA 

<213> Homo sapiens 



<400> 98 

aggggactct caccctctcc cagcaatgtc taaagtcagg catctgaaaa ccagcagtaa 60 

tcctgcctct gaagtttatc aggaaaggag cttaaaagag aaccaaattc agcctgtgtt. 120 

ggaactctca gtcccagagg ggtgtggttt atagctctcc ggcctgctgt tggacttagg 180 

ctgtgaccca cagaaggacg ccagaaagta ctcaagacat tcacggtgcc ccggtcagca 240 

ctcgccatga cgaagacttc tacatgcata taccacttcc ttgttctgag ctggtatact 300 

ttcctcaatt attacatctc acaggaagga aaagacgagg tgaaacccaa aatcttggca 360 

aatggtgcaa ggtggaaata tatgacgctg cttaatctgc tcttgcagac cattttctac 420 

ggggtcacct gcctggatga tgtgctgaaa agaaccaaag ggggaaaaga cattaagttc 480 

ctaactgcct tcagagacct gcttttcacc actctggctt ttcctgtatc cacgtttgta 540 

tttttggcat tctggatcct ctttctctac aatcgagatc tcatttaccc caaggtccta 600 

gatactgtca tccccgtgtg gctgaatcat gcaatgcaca ctttcatatt ccccatcaca 660 

ttggctgaag tcgtcctcag gcctcactcc tatccatcaa agaagacagg actcaccttg 720 

stggctgctg ccagcattgc ttacatcagc cgcatcctat ggctctactt tgagacgggt 780 

acctgggtgt atcctgtgtt tgccaaactc agcctcttgg gtctagcagc tttcttctct 840 

ctcagctacg tcttcatcgc cagcatctac ctacttggag agaagctcaa ccactggaaa 900 

tggggtgaca tgaggcagcc acggaagaag aggaagtaat tgcacaccat tttccaagaa 960 

ccaagaaaga agaaaacaca agagattttt ctcatctttt tttttttttt tctggtggag 1020 

ggaggtggtg gaggaacata gcaaagtagg agggacagag agtgatactt aaatttaata 1080 

agaggttcgt gaaggtaaaa aaaaaaaaaa aaaactcgag 1120 



<210> 99 

<211> 2596 

<212> DNA 

<213> Homo sapiens 



<400> 99 

ccacgcgtcc gacttggcaa gcgttcacaa ccaaaatggc cagctctttc tggaagatat 60 

tgtaaaacgt gatggatttc cactatgggt tgggctctca agtcatgatg gaagtgaatc 120 

aagttttgaa tggtctgatg gtagtacatt tgactatatc ccatggaaag gccaaacatc 180 

tcctggaaat tgtgttctct tggatccaaa aggaacttgg aaacatgaaa aatgcaactc 240 

tgttaaggat ggtgctattt gttataaacc tacaaaatct aaaaagctgt cccgtcttac 300 

atattcatca agatgtccag cagcaaaaga gaatgggtca cggtggatcc agtacaaggg 360 

tcactgttac aagtctgatc aggcattgca cagtttttca gaggccaaaa aattgtgttc 420 

aaaacatgat cactctgcaa ctatcgtttc cataaaagat gaagatgaga ataaatttgt 480 

gagcagactg atgagggaaa ataataacat taccatgaga gtttggcttg gattatctca 540 

acattctgtt gaccagtctt ggagttggtt agatggatca gaagtgacat ttgtcaaatg 600 
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ggaaaataaa agtaagagtg gtgttggaag atgtagcatg ttgatagctt caaatgaaac 660 

ttggaaaaaa gttgaatgtg aacatggttt tggaagagtt gtctgcaaag tgcctctggg 720 

ccctgattac acagcaatag ctatcatagt tgccacacta agtatcttag ttctcatggg 780 

cggactgatt tggttcctct tccaaaggca ccgtttgcac ctggcgggtt tctcatcagt 840 

tcgatatgca caaggagtga atgaagatga gattatgctt ccttctttcc atgactaaat 900 

tcttctaaaa gttttctaat ttgcactaat gtgttatgag aaattagtca cttaaaatgt 960 

cccagtgtca gtatttactc tgctccaaag tagaactctt aaatactttt tcagttgttt 1020 

agatcttagg catgtgctgg tatccacagt taattccctg ctaaatgcca tgtttatcac 1080 

cctaattaat agaatggagg ggactccaaa gctggaactg aagtccaaat tgtttgtaca 1140 

gtaatatgtt taatgttcat tttctctgta tgaatgtgat tggtaactag atatgtatat 1200 

tttaatagaa tttttaacaa aacttcttag aaaattaaaa taggcatatt actaggtgac 1260 

atgtctactt tttaattttt aagagcatcc ggccaaatgc aaaattagta cctcaaagta 1320 

aaaattgaac tgtaaactct atcagcattg tttcaaaata gtcattttta gcactgggga 1380 

aaaataaaca ataagacatg cttacttttt aatttttatt tttttgagac tgagtctctc 1440 

tctgttgccc aggctggagt acaatggcgt gatctcggct cactgcaaat ctccgcctcc 1500 

caggttcaag cgattctcct gcctcagcct cctgagtagc tgggattaca ggcaactgcc 1560 

accatgcccg gctaattttt gtatttttag tagagatggg gtttcaccat gttggccagg 1620 

ctggtctcga actcgtgacc gcaggtgatc ctcccgcctc ggcctcccaa agtgctggga 1680 

ttacaggcat gagccaccgc gcctggcctc tgcttacttt ttatatagca aaatgattcc 1740 

tcttggcaag atgtttctta tattattcca aagttatttc ataccattat tatgtaaata 1800 

tgaagagttt ttttctgttt ataattgttt ataaaacaat gacttttaaa gatttagtgc 1860 

ttaacatttt cccaagtgtg ggaacattat ttttagattg agtaggtacc ttgtagcagt 1920 

gtgctttgca ttttctgatg tattacatga ctgtttcttt tgtaaagaga atcaactagg 1980 

tatttaagac tgataatttt acaatttata tgcttcacat agcatgtcaa cttttgacta 2040 

agaattttgt ttactttttt aacatgtgtt aaacagagaa agggtccatg aaggaaagtg 2100 

tatgagttgc atttgaaaaa tgagactttt tcagtggaac tctaaacctt gtgatgacta 2160 

ctaacaaatg taaaattatg agtgattaag aaaacattgc tttgtggtta tcactttaag 2220 

ttttgacacc tagattatag tcttagtaat agcatccact ggaaaaggtg aaaatgtttt 2280 

attcagcatt taacttacat ttgtacttta gagtattttt gtataaaatc catagattta 2340 

ttttacattt agagtattta cactatgata aagttgtaaa taattttcta agacagtttt 2400 

tatatagtct acagttgtcc tgatttctta ttgaatttgt tagactagtt ctcttgtctt 2460 

gtgatctgtg tacaatttta gtcactaaga ctttcctcca agaactaagc caacttgatg 2520 

tgaaaagcac agctgtatat aatggtgatg tcataataaa gttgttttat cttttaagta 2580 

aaaaaaaaaa aaaaaa 2596 



<210> 100 

<211> 1020 

<212> DNA 

<213> Homo sapiens 



<400> 100 

aaactagggg aaaatgtagc caacatatac aaagatcttc agaaactctc tcgcctcttt 60 

aaagaccagc tggtgtatcc tcttctggct tttacccgac aagcactgaa cctaccagat 120 

gtatttgggt tggtcgtcct cccattggaa ctgaaactac ggatcttccg acttctggat 180 

gttcgttccg tcttgtcttt gtctgcggtt tgtcgtgacc tctttactgc ttcaaatgac 240 

ccactcctgt ggaggttttt atatctgcgt gattttcgag acaatactgt cagagttcaa 300 

gacacagatt ggaagactgt acaggaagag gcacatacaa agaaaagaat ccccgaaagg 3 60 

gcggtttgtg atgctcctgc catcgtcaac tcacaccatt ccattctatc ccaacccctt 420 

gcaccctagg ccatttccta gctcccgcct tcctccagga attatcgggg gtgaatatga 480 

ccaaagacca acacttccct atgttggaga cccaatcagt tcactcattc ctggtcctgg 540 

ggagacgccc agccagtttc ctccactgag accacgcttt gatccagttg gcccacttcc 600 

aggacctaac cccatcttgc cagggcgagg cggccccaat gacagatttc cctttagacc 660 

cagcaggggt cggccaactg atggccggct gtcattcatg tgattgattt gtaatttcat 72 0 

tztctggagct ccatttgttt ttgtttctaa actacagatg tcaactcctt ggggtgctga 780 
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tctcgagtgt tattttctga ttgtggtgtt gagagttgca ctcccagaaa ccttttaaga 840 

gatacattta tagccctagg ggtggtatga cccaaaggtt cctctgtgac aaggttggcc 900 

ttgggaatag ttggctgcca atctccctgc tcttggttct cctctagatt gaagtttgtt 960 

ttctgatgct gttcttacca gattaaaaaa aagtgtaaat taaaaaaaaa aaaaaaaaaa 1020 



<210> 101 

<211> 1520 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (71) 

<223> n equals a,t,g, or c 
<220> 

<221>. SITE 
<222> (473) 

<223> n equals a,t # g, or c 



<400> 101 

gcttttttct taagtgcaca aagcatcata ctccctggag gcaaacacat cgggctgctt 60 

cagcgttacg ngatgcttag cattttgaat attgtggcaa aaaaattaaa agttcactta 120 

ttaatattta tcagcagtat cataatttcc atcctcttat ttcagaattt cacttgaggc 180 

aaaaatacca caagtgtaat tactctagca cagctattaa tgtgctggat gataggccac 240 

tgcgtcacat gaccttctat tgttcatggg tttaaagaga aagcagggct ttgtatttct 300 

ttttcttctt ttaaagtcga ctgtagcatc ttggcttttg tctggggtgg ggaggatctg 360 

gggtctggtt cactttgtaa aagtaaacca tgtctgttta aacaatagag gtgtttaaga 420 

agactcttta gttttcctgc agattgttca agattacatg ataatcacac ggngtattta 480 

tttcctactg acaaaccaag tacttgttac atcaccaatg gtaccaggag atgaagacgc 540 

gggttttgag caggagcgag attaccaccc aaaaagggag ctacctgagg cagcccagct 600 

tctagcaaac tttttacatg ttgcacattt cagttcttaa atgaaggcta ctccagtgtc 660 

atttcattaa agtacctggg tgtagtactc aagtcccccc tcaagagttc ataagtaagc 720 

agtatccttt tggccagtgg tcctgttttt gcccctaccc agactgttcg rgaagcatat 780 
tctatagata aatctgacat ttgtcatcca ataccattgc agtcctctgc agcatacatt 
ctcaatgggg gctgtatcac ccctagattg gttctgagat actgcaatgt cttgtgtcct 
tccaaaggac cataatactt gagcaaatat gaacatttct tggggtgagg gcagaaagag 
agaaacaaaa gtctaaaaag ggacaataat gaaaaaacag ttgagacctt tagtatgatg 

ggaacaggat gaggaaggag gagatactga caggagccct gggtcttgct ctgcattaaa 1080 

cagatattta tggacattaa acagatattt atggagcacg actctgtacc ctacaggccc 1140 

agaatagtct taaggctcct gggaattgat gataggccat ttacccagtt tcagtttaga 1200 

ggcagattca ctggccttag catttcagta attatattta tttattttta gcctgaacca 1260 

gatttaatag gagaaactac tttctgcgtt tcttttaatt acttgtagtt tacacagtaa 1320 

ctttagaaga gtaaatgaaa gcatgcttcg atgctgccac tgtaaatacc attcattagt 1380 

aacttatttt ccctggagtc ttgtgaagtg tgaatttaaa gcctgctcta tctggaatat 1440 

ggaatagtat taagattaca agcacatttt atattcatga gccggaaagg caaaaaaaaa 1500 

aaaaaaaaat gaccctcgag 1520 



840 
900 
960 
1020 



<210> 102 
<211> 1306 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (1300) 

<223> n equals a,t,g, or c 



<400> 102 

aattcccggg tcgacccacg cgtccggaat ttaagggacc cacactacct tcccgaagtt 60 

gaaggcaagc ggtgattgtt tgtagacggc gctttgtcat gggacctgtg cggttgggaa 120 

tattgctttt cctttttttg gccgtgcacg aggcttgggc tgggatgttg aaggaggagg 180 

acgatgacac agaacgcttg cccagcaaat gcgaagtgtg taagctgctg agcacagagc 240 

tacaggcgga actgagtcgc accggtcgat ctcgagaggt gctggagctg gggcaggtgc 300 

tggatacagg caagaggaag agacacgtgc cttacagcgt ttcagagaca aggctggaag 3 60 

aggccttaga gaatttatgt gagcggatcc tggactatag tgttcacgct gagcgcaagg 420 

gctcactgag atatgccaag ggtcagagtc agaccatggc aacactgaaa ggcctagtgc 480 

agaagggggt gaaggtggat ctggggatcc ctctggagct ttgggatgag cccagcgtgg 540 

aggtcacata cctcaagaag cagtgtgaga ccatgttgga rgargaggar gaagaggagg 6 00 

aagaggaagg gggagacaag atgaccaaga caggaagcca ccccaaactt gaccgagaag 660 

atctttgacc cttgcctttg agcccccagg aggggaaggg atcatggaga gccctctaaa 720 

gcctgcactc tccctgctcc acagctttca gggtgtgttt atgagtgact ccacccaagc 780 

ttgtagctgt tctctcccat ctaacctcag gcaagatcct ggtgaaacag catgacatgg 840 

cttctggggt ggagggtggg ggtggaggtc ctgctcctag agatgaactc tatccagccc 900 

cttaattggc aggtgtatgt gctgacagta ctgaaagctt tcctctttaa ctgatcccac 960 

ccccacccaa aagtcagcag tggcactgga gctgtgggct ttggggaagt cacttagctc 1020 

cttaaggtct gtttttagac ccttccaagg aagaggccag aacggacatt ctctgcgatc 1080 

tatatacatt gcctgtatcc aggaggctac acaccagcaa accgtgaagg agaatgggac 1140 

actgggtcat ggcctggagt tgctgataat ttaggtggga tagatacttg gtctacttaa 1200 

gctcaatgta acccagagcc caccatatag ttttataggt gctcaatttt ctatatcgct 1260 

attaaacttt tttctttttt tctaaaaaaa aaaaaaaaan actcga 1306 



<210> 103 
<211> 785 
<212> DNA 
<213> Homo sapiens 



<400> 103 

cttttagaag gtacgcctgc aggtaccggt ccggaattcc cgggtcgacc cacgcgtccg 60 

ggaaatgaac taccatttat aacttctgtt tttttattga gaaaatgatt cacgaattcc 120 

aaatcagatt gccaggaaga aataggacgt gacggtactg ggccctgtga ttctcccagc 180 

ccttgcagtc cgctaggtga gaggaaaagc tctttacttc cgcccctggc agggacttct 240 

gggttatggg agaaaccaga gatgggaatg aggaaaatat gaactacagc agaagcccct 3 00 

gggcagctgt gatggagccc ctgacattac tcttcttgca tctgtcctgc cttctttccc 360 

tctgcgaggc agtggggtgg gattcagagt gcttagtctg ctcactggga gaagaagagt 42 0 

tcctgcgcat gcaagccctg ctgtgtggct gtcgtttaca tttgggaggt gtcctgtatg 480 

tctgtacgtt ggggactgcc tgtatttgga agatttaaaa acctagcatc ctgttctcac 540 

cctctaagct gcattgagaa atgactcgtc tctgtatttg tattaagcct taacactttt 600 

cttaagtgca ttcggtgcca acatttttta gagctgtacc aaaacaaaaa gcctgtactc 660 

acatcacaat gtcattttga taggagcgtt ttgttatttt tacaaggcag aatggggtgt 720 

aacagttgaa ttaaacttag caatcacgtg ctcaaaaaaa aaaaaaaaaa aaaaagggcg 7 80 

gccgc 785 



<210> 104 
<211> 2015 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> SITE 
<222> (3) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (9) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1981) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1990) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2001) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2002) 

<223> n equals a,t,g, or c 



<400> 104 

ccnggaatnc cgggtcgacc cacgcgtccg gcctgcgctg ccagcagcca ggagccagga 60 
gccaagagca gagcgccagc atgaacttgg gggtcagcat gctgaggatc ctcttcctcc 120 
tggatgtagg aggagctcaa gtgctggcaa caggcaagac ccctggggct gaaattgatt 
tcaagtacgc cctcatcggg actgctgtgg gtgtcgccat atctgctggc ttcctggccc 
tgaagatctg catgatcagg aggcacttat ttgacgacga ctcttccgac ctgaaaagca 
crcctggggg cctcagtgac accatcccgc taaagaagag agccccaagg cgaaaccaca 360 
atttctccaa aagagatgca caggtgattg agctgtaggt gagcagtgac gtgaagaggg 420 
gttctagccc cgtggaaaac agcccatggt taacatctca ggatgtcctg cattcaaaca 
cccaaggctg gtaatgaact ttcacatgga ctgaatattg gaggcaaata atagaaggaa 
tagaatatac agtgcctctg tcctgaagga aaatatcatg cctcttctgg aagaaacgga 
ctgcacagag gaaggattga gcaatttagc ctgcagtgga agaaggtgga caccaaaagc 
ttcaccctgt gttggagctg ttcatgcttc catgaggcca tggtgtccat gtccgtggaa 720 

780 
840 
900 
960 



180 
240 
300 



480 
540 
600 
660 



cctaccacag aaaatggctc atgaaaaggg gaatccgacc caacacacag cttcctacac 
actgccatct tatcaacagt taggcactac tttgtagaac gattagcttc accctcttag 
ctgccaggag atcccttctt aaagatggac tatgtgaaga ttcgggagtc ctgaaacatg 
gggactccgg gatggtctct agccctatcg atgatgaaca ctggccttct ggaggggaaa 

tggcagtctg ggctggcgtg gtaggaaggg ctttggtgtt catggaatgg gcctgctgct 102 0 

ctcagacctt caaaggatgg aaccaacgaa ggaccaaatg agaaagcaga tgcttgcctt 1080 

gcagagggcc atgaatgtca gttattattt ttctccttat acaattattt tgtggttatt 1140 

attacaatgt acatggctgt tgcatagaag acatgactgg tggaggctga ggaaagccat 1200 

gacattctac aattgccatc aggctaaggc cccgtgagca tttctctccc ttgtaatatt 12 60 

aaccctgtat ttctgggatc acatcacgga atattctttg cctttccact ttccaggaaa 1320 
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tctctcggac tgggctaccc tccttgtgtg tgatgaaaga tgagctatat ttcagaacaa 13 80 

agtgctgtgt tgtcatratt tgcctggact cccagggcgt ctcttaccca acttgataac 1440 

gatgctgttc attagcagcc tttgttaact gataaccaag agcggtaatg tgatactcat 1500 

aagcaatttt ctgtgtgtag gataaaataa accatcttgt atgggaaaaa aaaaaaaaaa 1560 

aaaaaaaaaa aaaaagggcg gccgctctag aggatccaag cttacgtacg cgtgcatgcg 1620 

acgtcatagc tcttctatag tgkcacctaa attcaattca ctggccgtcg ttttacaacg 1680 

tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt 1740 

cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag 1800 

cctgaatggc gaatgggacg cgecctgtag cggcgcatta agcgcggcgg gtgtggtggt 1860 

tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt 1920 

cccttccttt ctcgccacgt tcgccgggtt tccccgtcaa gctttaaatc gggggcttcc 1980 

nttaagggtn ccaattaagg nnttaccggg acctt 2015 



<210> 105 
<211> 367 
<212> DNA 

<213> Homo sapiens 
<400> 105 

cggcacgagt gtaaatgtca ccaccaaagg tttgcaccct gatcaaaaag agtatgaaaa 60 
gaataatacc acaacactta tggcctgtct tggaggcctt ctggggatta ttggtgtgat 120 
atgtcttatc agctgcctct ctccagaaat gaactgtgat ggtggacaca gctatgtgag 180 
gaattactta cagaaaccaa cctttgcatt aggtgagctt tatcctcctc tgataaatct 240 
ctgggaagca ggaaaagaaa aaagtacatc actgaaagta aaagcaactg ttataggttt 300 
accaacaaat atgtcctaaa aaccaccaag gaaacctact ccaaaaatga aaaaaaaaaa 360 
aaaaaaa 3 67 



<210> 106 
<211> 1889 
<212> DNA 

<213> Homo sapiens 



<400> 106 

ctcatccttc tatcatcata tggagtggca ataatgaaaa tgaggaggcg ctgatgatga 60 

attggtatca tatcagtttc actgaccggc caatctacat caaggactat gtgacactct 120 

atgtgaaaaa catcagagag ctcgtactgg caggagacaa gagtcgtcct tttattacgt 180 

ccagtcctac aaatggggct gaaactgttg cagaagcctg ggtctctcaa aaccctaata 240 

gcaattattt tggtgatgta catttttatg actatatcag tgattgctgg aactggaaag 300 

ttttcccaaa agctcgattt gcatctgaat atggatatca gtcctggccg tccttcagta 360 

cattagaaaa ggtctcgtct acagaggact ggtctttcaa tagcaagttt tcacttcatc 420 

gacaacatca cgaaggtggt aacaaacaaa tgctttatca ggctggactt catttcaaac 480 

tcccccaaag cacagatcca ttacgcacat ttaaagatac catctacctt actcaggtga 540 

tgcaggccca gtgtgtcaaa acagaaactg aattctaccg ccgtagtcgc agcgagatag 600 

tggatcagca agggcacacg atgggggcac tttattggca gttgaatgac atctggcaag 660 

ctccttcctg ggcttctctt gatacggagg aaagtggaaa atgcttcatt actttgctca 720 

gaatttcttt gctccactgt tgccagtagc tttgaaatga aaacatgttc tatatctatg 780 

gtgtgtcaga tcttcactcg gattattcga tgacactcag tgtgagagtc catacatgga 840 

gctccctgga gcccgtgtgc tctcgtgtga ctgaacgttt tgtgatgaaa ggaggagagg 900 

ctgtctgcct ttatgaggag ccagtgtctg aattgctgag gagatgtggg aattgcacac 960 

gggaaagctg tgtggtttcc ttttaccttt cagctgacca tgaactcctg agcccgacca 1020 

actaccactt cctgtcctca ccgaaggagg ccgtggggct ctgcaaggcg cagatcactg 1080 

ccatcatctc tcagcaaggt gacatatttg tttttgacct ggagacctca gctgtcgctc 1140 

cctttgtttg gttggatgta ggaagcatcc cagggagatt tagtgacaat gg.tttcctca 1200 
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tgactgagaa gacacgaact atattatttt acccttggga gcccaccagc aagaatgagt 1260 

tggagcaatc ttttcatgtg acctccttaa cagatattta ctgaaggaat ctaggttgta 1320 

ttttcagtgg acaatgggaa taaagcattt ctaaagcacc gactggagag gaaggcaaca 1380 

gagacaagga gagaagccga gagacatgtc tgcgtgctgc cacgcatctg agcgattgct 1440 

ctgtgaagag ttgtacactg aacattttca ggggaggctg tttacccagg caatgtcctc 1500 

aaacaagcct gtgccggggt gtcctggaat ctgtgccagg actgtgtttt tagcccttca 1560 

cctctcagct ttagcaggac atgaaccagt tataacaaga tggccctgca gctggttaca 1620 

agaatgtgac atggcaggat ctatggaacc aaatggaagg ttttgaggtg atgtaggtct. 1680 

ttcacagtta gctttgggga atacagaata ctcaaataaa gtgctttgtt attatttcag 1740 

agggaatggc gattgaaatg ttacaacaga gatttcttgg tggtagctat ttgggtaaag 1800 

gtatatggat atttttctgt acatgtgaaa ttatataaaa ataaaagtta tataaattac 1860 

attgaaaaaa aaaaaaaaaa aaaaaaaaa 1889 



<210> 107 

<211> 1201 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (1086) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1161) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 
<222> (1176) 

<223> n equals a,t,g, or c 



<400> 107 

cggcacgagc ggctggcagc acgactcgcg taccgtgcgc cgattgcctc tcggcctggg 60 

caatggtccc ggctgccggt cgacgaccgc cccgcgtcat gcggctcctc ggctggtggc 120 

aagtattgct gtgggtgctg ggacttcccg tccgcggcgt ggagggacct tatggatttt 180 

ctgaacccaa acggtagtga ctgtactcta gtcctgtttt acaccccgtg gtgccgcttt 240 

tctgccagtt tggcccctca ctttaactct ctgccccggg catttccagc tcttcacttt 300 

ttggcactgg atgcatctca gcacagcagc ctttctacca ggtttggcac cgtagctgtt 360 

cctaatattt tattatttca aggagctaaa ccaatggcca gatttaatca tacagatcga 42 0 

acactggaaa cactgaaaat cttcattttt aatcagacag gtatagaagc caagaagaat 480 

gtggtggtaa ctcaagccga ccaaataggc cctcttccca gcactttgat aaaaagtgtg 540 

gactggttgc ttgtattttc cttattcttt ttaattagtt ttattatgta tgctaccatt 600 

cgaactgaga gtattcggtg gctaattcca ggacaagagc aggaacatgt ggagtagtga 660 

tggtctgaaa gaagttggaa agaggaactt caatccttcg tttcagaaat tagtgctaca 720 

gtttcataca ttttctccag tgacgtgttg acttgaaact tcaggcagat taaaagaatc 780 

atttgttgaa caactgaatg tataaaaaaa ttataaactg gtgttttaac tagtattgca 840 

ataagcaaat gcaaaaatat tcaatagatg cactattctt gtttttactg catgraacgta 900 

atccagtatt tggkaaagta atccaktttg aaatgtgrag rtgtattccg gcagaatagt 960 

gagtagaatg acagcttact atacagaagg cmaaaatagg actctcaggt aatagtttaa 1020 

ggaaaccctt gattccttat gatgatgttt aagaaaggtt agttttctgt ttctttgcca 1080 

gttttncttc taggagtcca tagccaggga aagtatgtga accagaattg gttagtgtga 1140 

ccccctccaa gtagccagtg ntgggaaata agggtncaat accttgatgt ttgtgatctc 1200 
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<210> 108 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (75) 

<223> Xaa equals stop translation 
<400> 108 

Met Asp Pro Leu Cys Leu Pro lie lie Leu Phe Ser Ala Val Val Leu 
1.5 10 15 

Arg Asn Leu Phe His Leu Leu lie Leu Thr Phe His Tyr Leu Pro Leu 
20 25 " 30 

Phe Cys Asp Asn Pro Leu lie Leu Glu Asp Leu Ser Cys lie His Leu 
35 40 45 

Arg Val Asn He Phe Lys Ala Lys Gin Pro Lys Phe Tyr Gly Asn Gin 
50 55 60 

Leu Gin Pro Cys Val Met Lys Ser Ser Ala Xaa 
65 70 75 



<210> 109 
<211> 202 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (202) 

<223> Xaa equals stop translation 
<400> 109 

Met Lys Leu Leu He Leu Phe Leu Ser His Leu Leu Ser Leu Ala Phe 
1 5 10 15 

Gly He Leu Cys Leu Ser Val Thr Val He Leu Ser Leu Leu Leu Ser 
20 25 30 

Phe Ser Lys Arg Gly Phe Ser Val Arg Ser Phe Gly Thr Gly Thr His 
35 40 45 

Val Lys Leu Pro Gly Pro Ala Pro Asp Lys Pro Asn Val Tyr Asp Phe 
50 55 60 

Lys Thr Thr Tyr Asp Gin Met Tyr Asn Asp Leu Leu Arg Lys Asp Lys 
65 70 75 80 
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Glu Leu Tyr Thr Gin Asn Gly lie Leu His Met Leu Asp Arg Asn Lys 
85 90 95 

Arg lie Lys Pro Arg Pro Glu Arg Phe Gin Asn Cys Lys Asp Leu Phe 
100 105 110 

Asp Leu lie Leu Thr Cys Glu Glu Arg Val Tyr Asp Gin Val Val Glu 
115 120 125 

Asp Leu Asn Ser Arg Glu Gin Glu Thr Cys Gin Pro Val His Val Val 
130 135 140 

Asn Val Asp lie Gin Asp Asn His Glu Glu Ala Thr Leu Gly Ala Phe 
145 150 155 160 

Leu lie Cys Glu Leu Cys Gin Cys lie Gin His Thr Glu Asp Met Glu 
165 170 175 

Asn Glu lie Asp Glu Leu Leu Gin Glu Phe Glu Glu Lys Ser Gly Arg 
180 185 190 

Thr Phe Leu His Thr Val Cys Phe Tyr Xaa 
195 200 



<210> 110 
<211> 371 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (31) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (193) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 110 

Met Gly Leu Lys Leu Leu Gin Lys Pro Gly Ser Leu Lys Thr Leu lie 
1 5 10 15 

Ala lie lie Leu Val Met Tyr lie Phe Met Thr lie Ser Val Xaa Cys 
20 25 30 

Trp Asn Trp Lys Val Phe Pro Lys Ala Arg Phe Ala Ser Glu Tyr Gly 
35 40 45 

Tyr Gin Ser Trp Pro Ser Phe Ser Thr Leu Glu Lys Val Ser Ser Thr 
50 55 60 

Glu Asp Trp Ser Phe Asn Ser Lys Phe Ser Leu His Arg Gin His His 
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65 70 75 80 

Glu Gly Gly Asn Lys Gin Met Leu Tyr Gin Ala Gly Leu His Phe Lys 
85 90 95 

Leu Pro Gin Ser Thr Asp Pro Leu Arg Thr Phe Lys Asp Thr lie Tyr 
100 105 110 

Leu Thr Gin Val Met Gin Ala Gin Cys Val Lys Thr Glu Thr Glu Phe 
115 120 125 

Tyr Arg Arg Ser Arg Ser Glu lie Val Asp Gin Gin Gly His Thr Met 
130 135 140 

Gly Ala Leu Tyr Trp Gin Leu Asn Asp lie Trp Gin Ala Pro Ser Trp 
145 150 155 160 

Ala Ser Leu Glu Tyr Gly Gly Lys Trp Lys Met Leu His Tyr Phe Ala 
165 170 175 

Gin Asn Phe Phe Ala Pro Leu Leu Pro Val Gly Phe Glu Asn Glu Asn 
180 185 190 

Xaa Phe Tyr lie Tyr Gly Val Ser Asp Leu His Ser Asp Tyr Ser Met 
195 200 205 

Thr Leu Ser Val Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys 
210 215 220 

Ser Arg Val Thr Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys 
225 230 235 240 

Leu Tyr Glu Glu Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys 
245 250 255 

Thr Arg Glu Ser Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu 
260 265 270 

Leu Leu Ser Pro Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala 
275 280 285 

Val Gly Leu Cys Lys Ala Gin lie Thr Ala lie lie Ser Gin Gin Gly 
290 295 300 

Asp lie Phe Val Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val 
305 310 315 320 

Trp Leu Asp Val Gly Ser He Pro Gly Arg Phe Ser Asp Asn Gly Phe 
325 330 335 

Leu Met Thr Glu Lys Thr Arg Thr He Leu Phe Tyr Pro Trp Glu Pro 
340 345 350 



Thr Ser Lys Asn Glu Leu Glu Gin Ser Phe His Val Thr Ser Leu Thr 
355 360 365 
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Asp lie Tyr 
370 



<210> 111 
<211> 114 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (114) 

<22 3> Xaa equals stop translation 
<400> 111 

Met Arg Pro Leu Leu Leu Gly Gly Tyr Trp Val Leu Cys Leu Ser Val 
• 1 5 10 15 

Leu Gly His Ala Ala Leu Tyr His Phe Trp Leu Arg Glu Glu Gly Lys 
20 25 30 

Gly Pro Pro Gin Val Xaa Ser Val Leu Ala Leu Ala Leu Pro Ala Gly 
35 40 45 

Ser Cys Ala Pro Gly Leu Pro Phe Pro Gly Pro Leu lie Pro Thr Gin 
50 55 60 

Leu Leu Phe Ala Leu Glu Trp Gly Thr Pro Thr Pro Leu Arg Asp His 
65 70 75 80 

Pro Pro His Ser Met His Ser Ala Pro Gin Asn Pro Pro Val Phe Leu 
85 90 95 

Gly Thr His Thr Cys Pro Pro Ser Trp Tyr Phe Arg Leu lie Pro Gin 
100 105 110 

Ala Xaa 



<210> 112 
<211> 152 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (152) 

<223> Xaa equals stop translation 
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<400> 112 

Met Arg Arg Leu Leu Leu Val Thr Ser Leu Val Val Val Leu Leu Trp 
1 5 10 15 

Glu Ala Gly Ala Val Pro Ala Pro Lys Val Pro lie Lys Met Gin Val 
20 25 " 30 

Lys His Trp Pro Ser Glu Gin Asp Pro Glu Lys Ala Trp Gly Ala Arg 
35 40 45 

Val Val Glu Pro Pro Glu Lys Asp Asp Gin Leu Val Val Leu Phe Pro 
50 55 60 

Val Gin Lys Pro Lys Leu Leu Thr Thr Glu Glu Lys Pro Arg Gly Gin 
65 70 75 " 80 

Gly Arg Gly Pro He Leu Pro Gly Thr Lys Ala Trp Met Glu Thr Glu 
85 90 95 

Asp Thr Leu Gly Arg Val Leu Ser Pro Glu Pro Asp His Asp Ser Leu 
100 105 . HO 

Tyr His Pro Pro Pro Glu Glu Asp Gin Gly Glu Glu Arg Pro Arg Leu 
115 120 125 

Trp Val Met Pro Asn His Gin Val Leu Leu Gly Pro Glu Glu Asp Gin 
130 135 140 

Asp His He Tyr His Pro Gin Xaa 
145 150 



<210> 113 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals stop translation 
<400> 113 

Met Pro Cys Gly Lys Phe Leu Phe Pro Val Ser Pro Val Ser Ser Leu 
1 5 10 15 

Ser Leu His Trp Ser Ala Val Leu Leu Leu Leu Leu Ala Asp Phe Pro 
20 25 30 

Arg Val His Gly Ser Pro Pro Gly Val Ser Arg Val Ser He Leu His 
35 40 45 

Cys Leu Phe Pro Phe Leu Ser Xaa 
50 55 
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<210> 114 
<211> 237 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (237) 

<223> Xaa equals stop translation 
<400> 114 

Met Glu Val Arg Leu lie Phe Leu Ser Gly Leu Cys lie Ala Val Ala 
15 10 15 

Val Val Trp Ala Val Phe Arg Asn Glu Asp Arg Trp Ala Trp lie Leu 
20 25 30 

Gin Asp lie Leu Gly He Ala Phe Cys Leu Asn Leu He Lys Thr Leu 
35 40 45 

Lys Leu Pro Asn Phe Lys Ser Cys Val He Leu Leu Gly Leu Leu Leu 
50 55 60 

Leu Tyr Asp Val Phe Phe Val Phe He Thr Pro Phe He Thr Lys Asn 
65 70 75 80 

Gly Glu Ser He Met Val Glu Leu Ala Ala Gly Pro Phe Gly Asn Asn 
85 90 95 

Glu Lys Leu Pro Val Val He Arg Val Pro Lys Leu He Tyr Phe Ser 
100 105 110 

Val Met Ser Val Cys Leu Met Pro Val Ser He Leu Gly Phe Gly Asp 
115 120 125 

He He Val Pro Gly Leu Leu He Ala Tyr Cys Arg Arg Phe Asp Val 
130 135 140 

Gin Thr Gly Ser Ser Tyr He Tyr Tyr Val Ser Ser Thr Val Ala Tyr 
145 150 155 160 

Ala He Gly Met He Leu Thr Phe Val Val Leu Val Leu Met Lys Lys 
165 170 175 

Gly Gin Pro Ala Leu Leu Tyr Leu Val Pro Cys Thr Leu He Thr Ala 
180 185 190 

Ser Val Val Ala Trp Arg Arg Lys Glu Met Lys Lys Phe Trp Lys Gly 
195 200 205 

Asn Ser Tyr Gin Met Met Asp His Leu Asp Cys Ala Thr Asn Glu Glu 
210 215 220 
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Asn Pro Val lie Ser Gly Glu Gin lie Val Gin Gin Xaa 
225 230 235 



<210> 115 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 
<400> 115 

Met Phe Cys Phe Tyr Leu His Phe lie Phe His Val Leu Ser Tyr Lys 
15 10 15 

Leu Asn Pro Leu Leu Phe Phe Ser Cys Ser Cys Phe Cys Phe lie Leu 
20 25 30 

Val Phe Leu Phe Pro Asp Tyr His Leu Gly Met Xaa 
35 40 



<210> 116 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals stop translation 
<400> 116 

Met Val Arg His lie Arg Glu Arg Arg Arg Gin Pro Leu Ala Phe Gin 
1 5 10 15 

Arg Val Leu Leu Ser Leu Cys Leu Leu Glu Gly lie Trp His Ser Pro 
20 25 ^ 30 

Ala Ala Ala Ala Gly Gly Gly Ser His Cys Ser Ser Trp Pro Ser Leu 
35 40 45 

Tyr Thr Thr Phe Gin Arg Val Ser Leu Leu Glu Leu Asp Leu Gly Leu 
50 55 60 

Xaa 
65 



<210> 117 
<211> 118 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 
<400> 117 

Met Ala Arg Ser Ala Leu Arg Leu Glu He Leu Gly Gin Leu Leu Val 
1 5 10 15 

Gly Val Ser Ser Cys Cys Ala Glu He Arg Ser Arg Ser Tyr Leu Gly 
20 25 30 

Phe Cys Trp Lys Asn He Gin Asp Glu Arg Lys Lys Lys He He Leu 
35 40 45 

Arg Gly Ser Arg Asn Leu Leu Cys Pro Arg Leu Leu Arg Pro Leu Glu 
50 55 60 

Pro Val Gin Ala Lys Gly Thr Gin Ser Val Asp Pro Arg Glu Val Val 
65 ' 70 75 80 

Arg Glu Thr Arg Ser Met Ser Thr Leu Pro Ala Asp Phe Cys Leu Leu 
85 90 95 

Pro Gin Ala Ser Arg Met Ala Gin Lys Gly Ser Pro Ser Arg Ser Ser 
100 105 HO 

Leu Gin Leu Leu Phe Xaa 
115 



<210> 118 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals stop translation 
<400> 118 

Met Thr Val Ser Leu Phe Leu Leu Leu Ala Thr Ser Gin Ser Gin Asp 
1 5 10 15 

Gly Cys Cys Asp Ser Gly Ser Cys Pro Asn Ser Arg Gin Gin Glu Gly 
20 25 30 

His Gly Ala Ala Pro Ala Ser Arg Cys Pro Cys Arg Pro Ser Leu Gin 
35 40 45 

Ala Gin Glu Pro Lys Glu Glu Ser Thr Gin Met Trp Cys Ser His Leu 
50 55 60 
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Xaa 
65 



<210> 119 

<211> 43 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 



<400> 119 

Met Leu Lys Trp Thr Gly Phe Leu Val Val Leu Val Ala Phe Lys Lys 
15 10 15 

lie Ser Ala Ser Phe Gin Val Asn Tyr Asn Leu Lys Phe Glu lie Ser 
20 25 30 

Phe Gly Glu Pro Trp Lys Phe Thr Gin Trp Xaa 
35 40 



<210> 120 

<211> 48 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 



<400> 120 
Met Ser Phe Gly 
1 

Ala Phe His Phe 
20 

Arg Phe Gly Asn 
35 



lie Ser lie His 
5 

lie Ala Leu Cys 

Pro Met Ser lie 
40 



Thr Cys Thr Tyr 
10 

Lys Val Thr Phe 
25 

Ser Leu Ser Ala 



Leu lie Phe lie 
15 

Phe Thr Asp Ser 
30 

Pro Phe Phe Xaa 
45 



<210> 121 
<211> 140 
<212> PRT 
<213> Homo sapiens 
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<220> 
<221> SITE 
<222> (140) 

<223> Xaa equals stop translation 
<400> 121 

Met Ala Leu Gly He Gin Lys Arg Phe Ser Pro Glu Val Leu Gly Leu 
15 10 15 

Cys Ala Ser Thr Ala Leu Val Trp Val Val Met Glu Val Leu Ala Leu 
20 25 30 

Leu Leu Gly Leu Tyr Leu Ala Thr Val Arg Ser Asp Leu Ser Thr Phe 
35 40 45 

His Leu Leu Ala Tyr Ser Gly Tyr Lys Tyr Val Gly Met He Leu Ser 
50 55 60 

Val Leu Thr Gly Leu Leu Phe Gly Ser Asp Gly Tyr Tyr Val Ala Leu 
65 70 75 80 

Ala Trp Thr Ser Ser Ala Leu Met Tyr Phe He Val Arg Ser Leu Arg 
85 90 95 

Thr Ala Ala Leu Gly Pro Asp Ser Met Gly Gly Pro Val Pro Arg Gin 
100 105 110 

Arg Leu Gin Leu Tyr Leu Thr Leu Gly Ala Ala Ala Phe Gin Pro Leu 
115 120 125 

He He Tyr Trp Leu Thr Phe His Leu Val Arg Xaa 
130 135 140 



<210> 122 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (89) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals stop translation 
<400> 122 

Met Met Asp Phe Leu Arg Cys Val Thr Ala Ala Leu He Tyr Phe Ala 
15 10 15 



He Ser He Thr Ala He Ala Lys Tyr Ser Asp Gly Ala Ser Lys Ala 
20 25 30 
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Ala Gly Gly Ser Val Pro Asp Thr 
35 40 

Glu Met Gly Arg Glu Leu Gly Ala 
50 55 

Ser Pro Val Met His Pro lie His 
65 70 

Leu Leu Pro Ser Cys Leu Gin Leu 
85 



Arg Ala Val Cys Pro Ser Arg Ser 
45 

Ala Ala Ser Arg Glu Gin Gly Val 
60 

Pro Val His Arg Cys Leu Ala Ser 
75 80 

Xaa Ser Thr Xaa 
90 



<210> 123 
<211> 347 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (242) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (246) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (347) 

<223> Xaa equals stop translation 
<400> 123 

Met Arg Arg Gly Ala Gly Ala Ala Arg Gly Arg Ala Ser Trp Cys Trp 
15 10 15 

Ala Leu Ala Leu Leu Trp Leu Ala Val Val Pro Gly Trp Ser Arg Val 
20 25 30 

Ser Gly He Pro Ser Arg Arg His Trp Pro Val Pro Tyr Lys Arg Phe 
35 40 45 

Asp Phe Arg Pro Lys Pro Asp Pro Tyr Cys Gin Ala Lys Tyr Thr Phe 
50 55 60 

Cys Pro Thr Gly Ser Pro He Pro Val Met Glu Gly Asp Asp Asp He 
65 70 75 80 

Glu Val Phe Arg Leu Gin Ala Pro Val Trp Glu Phe Lys Tyr Gly Asp 
85 90 95 



Leu Leu Gly His Leu Lys He Met His Asp Ala He Gly Phe Arg Ser 
100 105 110 
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Thr Leu Thr Gly Lys Asn Tyr Thr Met Glu Trp Tyr Glu Leu Phe Gin 
115 120 125 

Leu Gly Asn Cys Thr Phe Pro His Leu Arg Pro Glu Met Asp Ala Pro 
130 135 140 

Phe Trp Cys Asn Gin Gly Ala Ala Cys Phe Phe Glu Gly lie Asp Asp 
145 150 155 160 

Val His Trp Lys Glu Asn Gly Thr Leu Val Gin Val Ala Thr lie Ser 
165 170 175 

Gly Asn Met Phe Asn Gin Met Ala Lys Trp Val Lys Gin Asp Asn Glu 
180 185 190 

Thr Gly lie Tyr Tyr Glu Thr Trp Asn Val Lys Ala Ser Pro Glu Lys 
195 200 205 

Gly Ala Glu Thr Trp Phe Asp Ser Tyr Asp Cys Ser Lys Phe Val Leu 
210 215 220 

Arg Thr Phe Asn Lys Leu Ala Glu Phe Gly Ala Glu Phe Lys Asn lie 
225 " 230 235 240 

Glu Xaa Asn Tyr Thr Xaa He Phe Leu Tyr Ser Gly Glu Pro Thr Tyr 
245 250 255 

Leu Gly Asn Glu Thr Ser Val Phe Gly Pro Thr Gly Asn Lys Thr Leu 
260 265 270 

Gly Leu Ala He Lys Arg Phe Tyr Tyr Pro Phe Lys Pro His Leu Pro 
275 280 285 

Thr Lys Glu Phe Leu Leu Ser Leu Leu Gin He Phe Asp Ala Val He 
290 295 300 

Val His Lys Gin Phe Tyr Leu Phe Tyr Asn Phe Glu Tyr Trp Phe Leu 
305 310 315 320 

Pro Met Lys Phe Pro Phe He Lys He Thr Tyr Glu Glu He Pro Leu 
325 330 335 

Pro He Arg Asn Lys Thr Leu Ser Gly Leu Xaa 
340 345 



<210> 124 
<211> 234 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (173) 
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<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (234) 

<223> Xaa equals stop translation 
<400> 124 

Met His Arg Gly Lys Leu Asp Cys Ala Gly Gly Ala Leu Leu Ser Ser 
1 5 10 15 

Tyr Leu lie Val Leu Met lie Leu Leu Ala Val Val lie Cys Thr Val 
20 25 30 

Ser Ala lie Met Cys Val Ser Met Arg Gly Thr He Cys Asn Pro Gly 
35 40 45 

Pro Arg Lys Ser Met Ser Lys Leu Leu Tyr He Arg Leu Ala Leu Phe 
50 55 60 

Phe Pro Glu Met Val Trp Ala Ser Leu Gly Ala Ala Trp Val Ala Asp 
65 70 75 80 

Gly Val Gin Cys Asp Arg Thr Val Val Asn Gly He He Ala Thr Val 
85 90 95 

Val Val Ser Trp He He He Ala Ala Thr Val Val Ser He He He 
100 105 HO 

Val Phe Asp Pro Leu Gly Gly Lys Met Ala Pro Tyr Ser Ser Ala Gly 
115 120 125 

Pro Ser His Leu Asp Ser His Asp Ser Ser Gin Leu Leu Asn Gly Leu 
130 135 140 

Lys Thr Ala Ala Thr Ser Val Trp Glu Thr Arg He Lys Leu Leu Cys 
145 150 155 160 

Cys Cys He Gly Lys Asp Asp His Thr Arg Val Ala Xaa Ser Ser Thr 
165 170 175 

Ala Glu Leu Phe Ser Thr Tyr Phe Ser Asp Thr Asp Leu Val Pro Ser 
180 185 " 190 

Asp He Ala Ala Gly Leu Ala Leu Leu His Gin Gin Gin Asp Asn He 
195 200 205 

Arg Asn Asn Gin Asp Leu Pro Arg Trp Ser Ala Met Pro Gin Gly Ala 
210 215 220 

Pro Arg Lys Leu He Trp Met Gin Asn Xaa 
225 230 



<210> 125 
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<211> 54 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals stop translation 
<400> 125 

Met Gin Gly Val Leu Phe Gly Phe Val Trp Leu Phe Ser Phe Leu Trp 
15 10 15 

Gin Glu Asn Lys Ser Ser Ala Ser Pro Ser Thr Leu Ala Lys Ser Gly 
20 25 30 

Ser Pro Cys Pro Val Ser lie Pro Trp Met Pro Gly Val Leu Val Arg 
35 40 45 

Phe Phe Thr Leu Leu Xaa 
50 



<210> 126 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (82) 

<223> Xaa equals stop translation 
<400> 126 

Met Arg Met Arg Val Ala Val Ala Pro Arg Pro His Gin His Leu Val 
1 5 10 15 

Val Ser Val Ser Trp He Leu Ala He Leu He Ser Val Ser Gly Tyr 
20 25 30 

His Cys Phe His Leu Gin Phe Ser Tyr Met Val Xaa Asn He Phe Pro 
35 40 45 

His Val Tyr Leu Ser Ser Ala Tyr Leu Leu Arg Pro Val He Cys Ser 
50 55 60 

Asp Leu Leu Pro Val Phe Val Cys Leu His Val Cys Leu Cys Leu He 
65 70 75 80 



Phe Xaa 
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<210> 127 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 127 

Met Gly Trp Glu Ala Ala Leu Ala Leu Leu Val Ser Ala Val Phe Phe 
1 5 10 15 

Pro Trp Cys Thr lie Gin Arg Pro Asp Val Gly Thr Thr Ser Pro Gly 
20 25 30 

Gly Leu Glu Arg Arg Ser Lys Gly Phe Xaa 
35 40 



<210> 128 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 128 

Met Thr Phe Met lie Leu Lys Phe Phe Phe Leu Cys Gly Phe Val Leu 
1 5 10 15 

Asn Arg Leu lie Ala Arg Gin Leu Ala Lys lie His Ala lie His Ala 
20 25 30 

His Asn Gly Trp lie Pro Lys Ser Asn Leu Trp Leu Lys Met Gly Lys 
35 40 45 

Tyr Phe Ser Leu He Pro Thr Gly Phe Ala Asp Glu Asp He Asn Lys 
50 55 60 

Arg Xaa 
65 



<210> 129 
<211> 50 
<212> PRT 

<213> Homo sapiens 



OHCIWIrt. -»A//"\ * -re A rt A 4 i . 
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<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 129 

Met lie Val Asn His Phe Ser Phe Leu Phe Cys Trp lie Val Phe Cys 
15 10 15 

Phe Leu Leu Gin His Ser Cys Phe Cys Cys Ala Tyr Phe Trp Ser Phe 
20 25 30 

Asp Ser Leu Cys His Cys Phe Leu Ser His Thr Pro Leu Arg Phe Thr 
35 40 45 



Gin Xaa 
50 



<210> 130 
<211> 227 
<212> PRT 
<213> Homo sapiens 



<220> 
<221> SITE 
<222> (227) 

<223> Xaa equals stop translation 



<400> 130 

Met Glu Thr Val Val lie Val Ala 
1 5 

Leu Ala Ser Phe Ala Ala Leu Val 
20 

Arg Pro Arg Asp Leu Leu Gin Arg 
35 40 

Leu lie Gly Ala Met Glu Thr Gin 
50 55 

Asp Asp Val Val He Thr Asn Pro 
65 70 



He Gly Val Leu Ala Thr He Phe 
10 15 

Leu Val Cys Arg Gin Arg Tyr Cys 
25 30 

Tyr Asp Ser Lys Pro He Val Asp 
45 

Ser Glu Pro Ser Glu Leu Glu Leu 
60 

His He Glu Ala He Leu Glu Asn 
75 80 



Glu Asp Trp He Glu Asp Ala Ser Gly Leu Met Ser His Cys He Ala 
85 90 95 

He Leu Lys He Cys His Thr Leu Thr Glu Lys Leu Val Ala Met Thr 
100 105 HO 



Met Gly Ser Gly Ala Lys Met Lys Thr Ser Ala Ser Val Ser Asp He 
115 120 125 
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lie Val Val Ala Lys Arg lie Ser 
130 135 

Ser Met Tyr Pro Pro Leu Asp Pro 
145 150 

Ala Leu Leu Leu Ser Val Ser His 
165 

Cys His Leu Thr Gly Gly Leu Asp 
180 

Ala Glu Glu His Leu Glu Val Leu 
195 200 

Pro Asp Lys Gly Leu Pro Gly Pro 
210 215 

Ala lie Xaa 
225 



Pro Arg Val Asp Asp Val Val Lys 
140 

Lys Leu Leu Asp Ala Arg Thr Thr 
155 160 

Leu Val Leu Val Thr Arg Asn Ala 
170 175 

Trp lie Asp Gin Ser Leu Ser Ala 
185 190 

Arg Glu Ala Ala Leu Ala Ser Glu 
205 

Glu Gly Phe Leu Gin Glu Gin Ser 
220 



<210> 131 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 
<400> 131 

Met Gin Arg He Ala Ser Leu Leu Thr Leu Leu Thr Gin Leu Thr Leu 
15 10 15 

Ala Ala Gly Ser Thr Pro Ala Glu Thr He Ser Asp Ser Ala Glu Ala 
20 25 30 

Ser Leu Ser Ala Thr Pro Ser Leu Val Thr Trp Thr Gin Val Ser Gly 
35 40 45 

Leu Gin Pro Leu Val Glu Pro Cys Leu Arg Gin Thr Leu Lys Leu Leu 
50 55 60 

Ser Arg Pro Glu Met Trp Arg Ala Val Gly Pro Val Pro Val Ala Cys 
65 70 75 80 

Leu Leu Phe Leu Gly Ala Tyr Tyr Gin Ala Trp Ser Gin Gin Pro Ser 
85 90 95 

Ser Cys Pro Glu Asp Trp Leu Gin Asp Met Glu Arg Leu Ser Glu Ser 
100 105 HO 

Cys Cys Cys His Cys Xaa 
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<210> 132 
<211> 306 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (180) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (197) 

<223> Xaa equals any of the naturally occurring L-amino acids 



<220> 

<221> SITE 
<222> (306) 

<22 3> Xaa equals stop translation 



<400> 132 

Met Ser Glu Asp Arg Pro Met Leu Gin Phe Leu Leu His Thr Ser Phe 
15 10 15 

Leu Ser Pro Leu Phe He Leu Trp Leu Trp Thr Lys Pro He Ala Arg 
20 25 30 

Asp Phe Leu His Gin Pro Pro Phe Gly Glu Thr Arg Phe Ser Leu Leu 
35 40 45 

Ser Asp Ser Ala Phe Asp Ser Gly Arg Leu Trp Leu Leu Val Val Leu 
50 55 60 

Cys Leu Leu Arg Leu Ala Val Thr Arg Pro His Leu Gin Ala Tyr Leu 
65 70 75 80 

Cys Leu Ala Lys Ala Arg Val Glu Gin Leu Arg Arg Glu Ala Gly Arg 
85 90 95 

lie Glu Ala Arg Glu He Gin Gin Arg Val Val Arg Val Tyr Cys Tyr 
100 105 HO 

Val Thr Val Val Ser Leu Gin Tyr Leu Thr Pro Leu He Leu Thr Leu 
115 120 125 

Asn Cys Thr Leu Leu Leu Lys Thr Leu Gly Gly Tyr Ser Trp Gly Leu 
130 135 140 

Gly Pro Ala Pro Leu Leu Ser Pro Arg Pro He Leu Ser Gin Arg Cys 
145 150 155 160 



Pro 



His Arg Leu Trp Gly Gly Arg Ser Pro Ala Asp Cys Ser Ala Asp 
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165 



170 



175 



Cys Arg Gly Xaa Gly Trp Pro Ala 
180 

Pro Gly Leu Pro Xaa Leu Val Asp 
195 200 

Pro Phe Arg Pro Leu Leu Pro Pro 
210 215 



Tyr Ser Pro Leu Pro Pro Trp Arg 
185 190 

Gly Cys Leu Pro Ala Ala Arg Gin 
205 

Ala Leu Gly Arg Leu Leu Ala Ala 
220 



Cys Arg Pro Ser Trp Gly Pro Glu 
225 230 

Leu Ala Cys Pro Leu Cys Leu Arg 
245 

Gly Pro Asp Ser Pro Ala Phe Pro 
260 

Pro Pro Trp Thr Pro Ser Phe Cys 

275 280 

Ser Ser Met Arg Val Pro Arg Pro 
290 295 



Val Cys Ser Trp Gly Ser Gly Thr 
235 240 

Pro Arg Val Pro Ser Cys Lys Val 
250 255 

Ser Pro Gin Cys Leu Thr Arg Gly 
265 270 

Leu Arg Thr Val Ser Pro Gly Pro 
285 

Leu Ser Pro Lys Arg Met Cys Gin 
300 



Val Xaa 
305 



<210> 133 

<211> 45 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 



<400> 133 

Met Ser Tyr Ser Leu Phe Leu Ala Leu Leu Ser Phe Ala Ser Ala lie 
1 5 10 15 

Leu Phe Val Ala Gly Thr lie Ala Gly Thr Gly Gly Leu Ser Phe His 
20 25 30 

Gly lie Ala Thr lie Phe Val Leu Thr Gly Lys Trp Xaa 
35 40 45 



<210> 134 

<211> 44 

<212> PRT 

<213> Homo sapiens 



WO 99/47540 



PCT/US99/05804 



82 



<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 
<400> 134 

Met Gly Arg Leu Gly Xaa Gin Cys Leu Leu Phe Leu Ala Phe Lys Ala 
15 10 15 

lie Ser Gly Val Phe Phe Leu Phe Trp Arg Pro Ala Asp Ser Thr Glu 
20 25 30 

Arg Asn Thr Gin Ser Trp Asp Phe Pro Pro Leu Xaa 
35 40 



<210> 135 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 135 

Met Gly Val Gly Val Leu Arg lie Leu Leu Ser Cys Leu Gly Glu Ala 
15 10 15 

Ala Pro Lys Ser Ala Gly Thr Ser Leu Glu Ser Ala Lys Glu Cys Trp 
20 25 30 

Ser Ala Ala Thr Leu Leu Val Leu Cys Val Leu Cys Gin Leu Gin His 
35 40 45 

Gly Xaa 
50 



<210> 136 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (81) 

<223> Xaa equals stop translation 
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<400> 136 

Met Glu Ser Leu Pro Glu Asn Lys 
1 5 

Ser Leu Leu Ala lie lie Gly Leu 
20 

Asn Ser Gin Phe Gly Leu Val Asp 

35 40 

lie Ala Gin Val Leu Leu Leu Asp 
50 55 

Arg Val Leu Gin Phe Phe Leu Gly 
65 70 

Xaa 



83 



Pro Leu Val Trp Ser Leu Ala Val 
10 15 

Leu Leu Gly Ser Ser Pro Asp Phe 
25 30 

lie Pro Val Glu Phe Lys Leu Val 
45 

Phe Cys Leu Ala Leu Leu Ala Asp 
60 

Thr Pro Lys Leu Lys Val Pro Ser 
75 80 



<210> 137 
<211> 277 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (94) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (103) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (277) 

<223> Xaa equals stop translation 
<400> 137 

Met lie His Val Asn Arg Asn lie Met Asp Phe Lys Leu Phe Leu Val 
15 10 15 

Phe Val Ala Gly Val Phe Leu Phe Phe Tyr Ala Arg Thr Leu Glu Ser 
20 25 30 

Lys Pro Tyr Phe Leu Leu Leu Leu Gly Asn Cys Ala Arg Cys Ser Asn 
35 40 45 

Asp lie Val Phe Val Leu Leu Leu Val Lys Arg Phe lie Arg Ser lie 
50 55 60 

Ala Pro Phe Gly Ala Leu Met Val Gly Cys Trp Phe Ala Ser Val Tyr 
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65 70 75 80 

lie Val Cys Gin Leu Met Glu Asp Leu Lys Trp Leu Trp Xaa Glu Asn 
85 90 95 

Arg lie Tyr Val Ser Gly Xaa Val Leu lie Val Gly Phe Phe Ser Phe 
100 105 110 

Val Val Cys Tyr Lys His Gly Pro Leu Ala His Asp Arg Ser Arg Ser 
115 120 125 

Leu Leu Met Trp Met Leu Arg Leu Leu Ser Leu Val Leu Val Tyr Ala 
130 135 140 

Gly Val Ala Val Pro Gin Phe Ala Tyr Ala Ala lie lie Leu Leu Met 
145 150 155 160 

Ser Ser Trp Ser Leu His Tyr Pro Leu Arg Ala Cys Ser Tyr Met Arg 
165 170 "* 175 

Trp Lys Met Glu Gin Trp Phe Thr Ser Lys Glu Leu Val Val Lys Tyr 
180 185 190 

Leu Thr Glu Asp Glu Tyr Arg Glu Gin Ala Asp Ala Glu Thr Asn Ser 
195 200 205 

Ala Leu Glu Glu Leu Arg Arg Ala Cys Arg Lys Pro Asp Phe Pro Ser 
210 215 220 

Trp Leu Val Val Ser Arg Leu His Thr Pro Ser Lys Phe Ala Asp Phe 
225 230 235 240 

Val Leu Gly Gly Ser His Leu Ser Pro Glu Glu lie Ser Leu His Glu 
245 250 255 

Glu Gin Tyr Gly Leu Gly Gly Ala Phe Leu Glu Glu Gin Leu Phe Asn 
260 265 270 

Pro Ser Thr Ala Xaa 
275 



<210> 138 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (57) 

<223> Xaa equals stop translation 
<400> 138 

Met Cys Gin Thr Leu Pro Ala Arg Leu Arg Ala Gin Cys lie Ser Ser 
15 10 15 
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Leu Leu Phe Leu Leu Met Gly Leu Leu Ala Met Thr Gly Glu Arg Asn 
20 25 30 

Gin Gly Thr His Tyr Tyr Glu Phe Ser Gly Phe He Phe Lys Ser Gin 
35 40 45 

Met Met Trp Ser He Lys Pro Asn Xaa 
50 55 



<210> 139 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (71) 

<223> Xaa equals stop translation 
<400> 139 

Met Tyr Leu Trp Phe Ser Phe Ser Thr Val Gly Leu Cys Gly Cys Cys 
15 10 " 15 

Leu Leu Tyr Arg Ala Cys Gly Phe lie Trp Tyr Leu Leu Leu Leu Gly 
20 25 30 

His Ser Ser Thr Asn Ser Leu Gin Asp Gly Gly Ala Glu Arg Pro Glu 
35 40 45 

His Pro Trp Ala His Val Arg Tyr Ser Cys Arg Arg Glu Leu Ser Phe 
50 55 60 

Trp Phe Tyr Val Phe Asn Xaa 
65 70 



<210> 140 

<211> 63 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (63) 

<223> Xaa equals stop translation 
<400> 140 

Met Glu Pro Glu Ser Trp Ala Leu Cys Leu Leu Leu Phe Leu Gly Thr 
15 10 15 



Ala Leu Gly Tyr Pro Pro Leu Pro Arg His Ser Ser Lys Cys Glu He 
20 25 30 
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Leu Glu Val Arg Leu His Leu Leu Pro Leu Leu He Asn He Gly Met 
35 40 45 

Met Ser Pro Val Ala Ser Pro Phe Val Cys Ser He Thr Gly Xaa 
50 55 60 



<210> 141 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (89) 

<223> Xaa equals stop translation 
<400> 141 

Met Leu Phe Leu Ser Ala Ser He Cys Thr Ser Ala Leu Phe Leu Cys 
15 10 15 

Leu Ser Arg Leu Thr He Ser Ala Pro His Pro Ala Trp Trp Gly Arg 
20 25 30 

Met Pro Thr His Thr Ser Pro Gly His Leu Leu Glu Leu Gin Pro Arg 
35 40 45 

Gly Met Thr Glu Ser He Leu Phe Ser He Ser Ala Leu Val Ser Asn 
50 55 60 

Ser Trp Gly Lys Met Thr Gin Leu Thr Ser Gly Ser His Ser Trp Ser 
65 " ~ 70 75 80 

Ser Gly Leu Gin Asn Phe Gin Ala Xaa 
85 



<210> 142 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals stop translation 
<400> 142 

Met Arg Pro Val Cys Ser Leu Gly Trp Ala Gly Trp Pro Gly Leu Val 
1 5 10 15 

Cys Gly Leu Arg Ala Leu Leu Gly Pro Ser Leu Phe Pro Val Thr Phe 
20 25 30 

Gly Ala Thr Glu Ala Val His Ser Leu Asp Val Cys Ser Xaa 
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35 40 45 



<210> 143 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals stop translation 
<400> 143 

Met Val Asn Glu Lys Glu Ala Arg Thr Gly Ser Pro Lys Ser Trp Leu 
1 5 10 15 

Leu Cys Leu Ala Leu Leu Leu lie Lys Tyr Val Thr Phe Cys Lys Pro 
20 25 30 

Tyr Leu Thr Lys Pro Tyr Phe Leu His Leu Ser Val Leu Asp Gin Leu 
35 40 45 

Ser Pro Gly Thr Pro Leu Asp Xaa 
50 55 



<210> 144 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (77) 

<223> Xaa equals stop translation 
<400> 144 

Met Phe lie Ala lie Tyr Phe Lys Ala Phe His Gly Ser Phe Gin Leu 
15 10 15 

Cys Thr Trp Leu Val lie Met lie Val lie Leu Gly Gin Ser Phe Ser 
20 25 30 

Ala Leu Ala Leu Leu Thr Phe Trp Leu lie Leu Cys Cys Arg Gly Cys 
35 40 45 

Pro Val His Cys Arg Val Phe Ser Ser lie Pro Asp Leu Tyr Leu Leu 
50 55 60 

Asn Ala Arg Ser Asn Thr Val Pro Pro Ala Gin Leu Xaa 
65 70 75 



<210> 145 
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<211> 43 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 145 

Met Phe Phe Leu Ser Met Phe Leu His lie Val Leu Leu His Cys Gly 
15 10 15 

Asn Ser Phe Tyr Lys lie Cys His Ser Trp Asp Tyr Ala Ala Leu Gin 
20 25 30 

Glu Ser Thr Arg Phe Tyr Ser Asn Ser Tyr Xaa 
35 40 



<210> 146 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (102) 

<223> Xaa equals stop translation 
<400> 146 

Met Glu Leu Glu Arg Cys Ser Val Val Leu Cys lie Leu Ala Asn Leu 
! 5 10 15 

Ala Val Leu Arg Ala Leu Phe Leu Pro Cys He He Phe His Cys Val 
20 25 30 

Ser Asp Ser Arg Ser Val Asn Arg Glu Thr Lys Val Lys Phe Val His 
35 40 45 

Thr Ser Val His Gly Val Gly His Ser Phe Val Gin Ser Ala Phe Lys 
50 55 60 

Ala Phe Xaa Leu Val Pro Pro Glu Ala Val Pro Glu Gin Lys Asp Pro 
65 70 75 80 

Asp Pro Glu Phe Pro Thr Val Lys Tyr Pro Asn Pro Glu Glu Gly Lys 
85 90 95 



Gly Val Leu Val Thr Xaa 
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100 



<210> 147 
<211> 134 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (134) 

<223> Xaa equals stop translation 
<400> 147 

Met Arg Val Pro Leu Val Leu Ser 
1 5 

Ser Gly Val Tyr Leu Ala Ser Glu 
20 

Cys Asp Leu Thr Ser Pro Pro Gly 

35 40 

Asp Leu Val Ser Met Glu Glu Leu 
50 55 

Tyr Trp Met Met Val Phe Ser Leu 
65 70 

Gly Cys Phe Gin Glu Arg Gin Glu 
85 

Arg Cys Ser Gin Arg Asp Thr Ser 
100 

Ala Arg Val Arg Val Thr Lys Pro 
115 120 

Asn Arg His lie Arg Xaa 
130 



<210> 148 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (50) 
<223> Xaa equals stop translation 

<400> 148 

Met Lys Ser Leu Phe Cys lie Tyr Phe Leu Arg Trp Pro Met Gly Leu 
1 5 10 15 



Trp Ala 
10 

Ser Phe 
25 

Leu His 



Thr Asp 



Lys Ala 



Thr Asn 
90 

Pro Leu 
105 

Pro Thr 



Phe Val 

Trp Phe 

Leu Trp 

Ser Ala 
60 

Met Phe 
75 

Lys Ser 
Met Asp 
Thr Ala 



Leu Val 

Pro Pro 
30 

Lys Phe 
45 

Arg Glu 

Pro Val 

Leu Thr 

Gly Gin 
110 

Thr Ala 
125 



Gly Phe 
15 

Ser Leu 

lie Arg 

Met Gly 

Ser Ser 
80 

Leu Leu 
95 

Thr Trp 
Ala Tyr 
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Ser Trp Gly Glu Thr Phe lie Leu Leu Arg Asp Ser Leu Ala lie Asn 
20 25 30 

Phe Gin Ser Phe Ser Lys Ala Ala Ser Gly Asp lie Phe Gly Cys His 
35 40 45 

Asp Xaa 
50 



<210> 149 

<211> 64 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-araino acids 
<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals stop translation 
<400> 149 

Met Ser Cys Gly Leu Xaa Phe Gly Pro Trp Phe Val Pro Met Leu Leu 
1 5 10 15 

Met Ser His Ser Leu Leu Pro Ser Trp Ser Gly Leu Trp Val Thr Thr 
20 25 30 

Trp Asn Gly Ser Ser Gly Glu Arg Thr Pro Ser Pro Trp Arg Arg Lys 
35 40 45 

Arg Ala Ser Gin Ser Ala Gly Arg He Ala Ser Trp Met Ser Phe Xaa 
50 55 60 



<210> 150 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (59) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (75) 

<223> Xaa equals stop translation 
<400> 150 

Met Leu Ser Ser Pro Asn Leu Ala Ala Ser Leu Leu Cys Leu Trp His 
1 5 10 15 

Ser Gly Ser Ala Thr Asn Trp Ala Pro Pro Cys Ala Gly Met Trp Ala 
20 25 30 

Ser Arg Cys Gly Trp Lys Val Ser Pro His Pro Glu Ala Gly Pro Cys 
35 40 45 

Ser Ser Ala Leu Trp Val Ser Cys Cys Val Xaa Ala Glu Gin Pro Gin 
50 55 60 

Pro Gly Gly Arg Glu Pro Arg His Arg Gly Xaa 
65 70 75 



<210> 151 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 
<400> 151 

Met Pro His He Ser Phe Cys Leu Gly Thr Pro Tyr Val Val Ala Val 
15 10 15 

Tyr Leu Pro Ala Trp He Val Met Leu Leu Leu Pro Gly Val Arg Pro 
20 25 30 

Tyr Ser Ser Leu Gin Ala Leu Lys His Pro Ser Cys Ser Ser Ser Ser 
35 40 45 

Val Cys Ala Pro Tyr Met Xaa 
50 55 



<210> 152 

<211> 58 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 152 
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Met Gly Leu Asn lie Ser Pro Trp Cys Phe Leu Ala lie Leu Thr Cys 
1 5 10 15 

Ala lie Ser Ala Ala Phe lie Ser Val Gly Val Val Cys Trp Leu Leu 
20 25 30 

Phe Leu lie Ser His Arg Ser Ser Lys Asn Leu Arg Lys Ser Arg Val 
35 40 45 

Arg Gly Val Trp Glu Asn Glu Glu lie Xaa 
50 55 



<210> 153 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 153 

Met Ala Tyr Val Leu Ala Val Leu Cys Phe Lys Ser Leu Trp Ala Leu 
1 5 10 15 

Phe Lys Pro Asn Lys Gin Leu lie Glu Phe Leu Leu Met Val Lys Val 
20 25 30 

Val Lys lie Pro Leu Cys Tyr Leu Arg Gin Leu Leu Gly Gly lie Lys 
35 40 45 

Thr Pro Arg Val Xaa 
50 



<210> 154 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 154 

Met Asp Gly Gly Pro Gly Ala Phe Ser Arg Ala Trp Val Leu Gin lie 
15 10 15 

Pro Trp Leu Leu Leu Ser Gly Gly Asn Phe Ala Leu Cys Glu Pro Arg 
20 25 30 



Pro Cys Pro Ser Ala Gly His Pro Trp Gin Glu Ala Gly Leu Pro Ser 



WO 99/47540 



PCT/US99/05804 



93 



35 



40 



45 



Ser Pro Xaa 
50 



<210> 155 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals stop translation 
<400> 155 

Met Pro Phe Leu Ser Val Trp Phe Phe Asn Leu Gly Leu lie Phe Gly 
1 5 10 15 

Val Glu Ser Phe Val Leu Arg Ala Val Leu Phe lie Ala Gly Cys Ser 
20 25 30 

Ala Thr Ser Gin Met Glu Ala Ala Ser Pro Tyr Pro Ala Val Thr Lys 
35 40 45 

Arg Lys Lys Asn Val Ser Xaa His Cys Gin lie Ser Ser Gly Gly Ala 
50 55 60 

Pro Gly Xaa 
65 



<210> 156 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals stop translation 
<400> 156 

Met Leu Leu Lys Arg Asn Leu Leu lie Leu lie Leu Phe Leu Val Thr 
15 10 15 

Cys Phe Asn Phe Val Ser Phe Phe Phe Phe Pro Trp Lys Leu Leu Gly 
20 25 30 
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Ser Pro Phe Tyr Pro Cys Ser Leu Arg Ser Asp Asn Asp Gly Cys Val 
35 40 45 



Xaa 



<210> 157 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<400> 157 

Met Gly Ser Phe Leu His Pro Gin 
1 5 

Ala Val Leu Gly Lys Gly Leu His 
20 

His Gly Gly Ala Leu Gly Lys Val 
35 40 

Ser Lys Glu Val Leu Leu Lys Lys 
50 55 



Trp His Leu Leu lie Thr Phe Cys 
10 15 

Ser Asp Pro Ser Arg Pro Phe Glu 
25 30 

Pro Arg Gly Arg Ser Thr Leu Leu 
45 

Lys Lys Lys Lys Arg 
60 



<210> 158 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (113) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 
<400> 158 

Met Leu Leu Trp Trp Gin Cys Leu Cys Cys His Ala Val Leu Glu Pro 
15 10 15 

Ala Ala Thr Ala Met Pro Glu Asp Ala Ala Pro Ser Ser Leu Pro Val 
20 25 30 

Pro Pro Asn Met Thr Ser Ser Arg Phe His Tyr Phe Trp Thr Leu Leu 
35 40 45 

Gin lie Lys Leu Thr Gin Phe Tyr Ser Lys Pro Arg Ser Leu Ser Ala 
50 55 60 

Thr Pro Glu Lys Asn lie Gly Leu Gin Glu Pro Glu Arg Arg Glu Arg 
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65 70 75 80 

Phe Thr Gly Glu Ser Cys Arg Trp Glu Leu Lys Aia Lys Ser Cys Leu 
85 90 95 

Cys Pro Thr Arg Asn Ser Leu Gly Cys Thr Gin Cys His Cys Asp Gly 
100 105 110 

Xaa Lys lie Cys Asn Xaa 
115 



<210> 159 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (151) 

<223> Xaa equals stop translation 
<400> 159 

Met Leu Ala Val Leu Ala Phe Pro Val Gly Val Phe Val Val Ala Val 
15 10 15 

Phe Trp lie lie Tyr Ala Tyr Asp Arg Glu Met lie Tyr Pro Lys Leu 
20 25 30 

Leu Asp Asn Phe lie Pro Gly Trp Leu Asn His Gly Met His Thr Thr 
35 40 45 

Val Leu Pro Phe lie Leu He Glu Met Arg Thr Ser His His Gin Tyr 
50 55 60 

Pro Ser Arg Ser Ser Gly Leu Thr Ala He Cys Thr Phe Ser Val Gly 
65 70 75 80 

Tyr He Leu Trp Val Cys Trp Val His His Val Thr Gly Met Trp Val 
85 90 95 

Tyr Pro Phe Leu Glu His He Gly Pro Gly Ala Arg He He Phe Phe 
100 105 110 

Gly Ser Thr Thr He Leu Met Asn Phe Leu Tyr Leu Leu Gly Glu Val 
115 120 125 

Leu Asn Asn Tyr He Trp Asp Thr Gin Lys Ser Met Glu Glu Glu Lys 
130 135 140 

Glu Lys Pro Lys Leu Glu Xaa 
145 150 



<210> 160 



WO 99/47540 



PCT/US99/05804 



96 



<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (92) 

<223> Xaa ecjuals stop translation 
<400> 160 

Met Gly Asp Lys Leu Gly Met Ala Arg Ala Pro Ser Val Ala Leu Ala 
1 " 5 10 15 

Gin Leu Trp Leu lie Cys Leu Cys Pro Glu Ser Leu Ala Ser Phe Val 
20 25 30 

Gin Ala Val Pro Trp Lys Val Leu Gin Pro Ser Ser Asn Arg Ser Thr 
35 40 45 

Asp Cys Ser Pro His Met Arg Pro Thr Cys Glu Thr Leu Gly Ser Arg 
50 55 60 

Lys Ala Gin Asp Leu Val Leu Asp Thr Met Cys Leu Ser Thr Asp Asp 
65 70 75 80 

Cys Gin Gly Leu lie Cys Arg Gly His Arg Ser Xaa 
85 90 



<210> 161 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 161 

Met Gin Val Ala Cys Val Met Lys Val Ser Ala Gin Trp Val Cys Phe 
1 5 10 15 

Phe Val Val Phe Ser Pro Leu Cys Ser Ser Val Lys Cys Ala Ser Ser 
20 25 30 

Gly Gin Asn Arg Gly Arg Gly Asp Gin Xaa 
35 40 



<210> 162 

<211> 78 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (78) 

<223> Xaa equals stop translation 
<400> 162 

Met Met Leu Gin lie lie His Leu Asn Thr Leu lie Lys Phe Phe Gin 
1 5 10 15 



Cys Leu Lys Leu Phe Leu His Gly 
20 

Leu Ala Tyr Lys Phe Ser Gin Phe 
35 40 

Lys Lys Val His His Leu Leu Ser 
50 55 

Ser Gin Ala Asp Asn Ser Ser Trp 
65 70 



Thr Ala Gly Ser Gly Gin Lys Cys 
25 30 

Pro Ser lie lie Pro Ala Ala His 
45 

Pro Lys Cys Leu Pro Thr Glu Cys 
60 

Asp Ser Ala Val Trp Xaa 
75 



<210> 163 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 
<400> 163 

Met Lys Arg Leu Trp Cys Leu Ser Trp Val Pro Gly Leu Gin Gly Ser 
15 10 15 

Pro Ser Val Leu Ser Ser Val Phe Phe Ser Val Phe Lys Pro Gin Leu 
20 25 30 

His Trp Thr Cys Ser Gin Val Ser Ser His Trp His Pro Pro Cys Leu 
35 40 45 

Phe lie Leu Phe Ser Gly Xaa 
50 55 



<210> 164 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (90) 

<223> Xaa equals stop translation 
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<400> 164 

Met Lys Phe Leu Leu Ala Ala Leu Val Leu Ser Leu lie Leu Pro Arg 
15 10 15 

Ser Ser Gin Tyr lie Lys Trp lie Val Ser Ala Gly Leu Ala Gin Val 
20 25 30 

Ser Glu Phe Ser Phe Val Leu Gly Ser Arg Ala Arg Arg Ala Gly Val 
35 40 45 

lie Ser Arg Glu Val Tyr Leu Leu lie Leu Ser Val Thr Thr Leu Ser 
50 55 60 

Leu Leu Leu Ala Pro Val Leu Trp Arg Ala Ala lie Thr Arg Cys Val 
65 70 75 80 

Pro Arg Pro Glu Arg Arg Ser Ser Leu Xaa 
85 90 



<210> 165 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 165 

Met Phe Val Trp His Leu Lys Val Met Val Met Phe He He Leu Tyr 
15 10 15 

Phe Ala Tyr Cys Glu Ser Asn Phe His Ser Val Leu Ser Val Ser Lys 
20 25 30 

Pro Leu Leu Lys He Leu Phe Leu Pro Arg Asn Leu Xaa 
35 40 45 



<210> 166 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 



<400> 166 

Met Thr Pro Gly Cys Ser Val Pro Phe Leu Leu Cys Trp Leu Phe Ala 
15 10 15 
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Leu Met Met Gin Glu Lys Trp Gly 
20 

His Tyr Ser Arg Gin Trp His Gin 
35 40 



Gly Val Lys Ser Leu Val Ser Tyr 
25 30 

Thr Val Val Val Xaa 
45 



<210> 167 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 167 

Met Ser lie Ala Leu Arg lie Asn Arg Leu His Phe Trp Val Leu Leu 
1 5 10 15 

Phe Phe Phe Phe Phe Ala Gin Leu Ser Leu Ser Val Asp Leu His Gly 
20 25 30 

Thr Ser Tyr Ser Leu Lys Ser Leu Ser Tyr Leu Thr lie Phe Leu Asp 
35 40 45 

Leu Glu Lys Leu Asp Val Gly Pro Tyr Glu Lys lie lie Arg Asn Gin 
50 55 60 

lie Xaa 
65 



<210> 168 

<211> 62 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals stop translation 
<400> 168 

Met Gin Leu Thr Leu Gly Gly Ala Ala Val Gly Ala Gly Ala Val Leu 
15 10 15 

Ala Ala Ser Leu Leu Trp Ala Cys Ala Val Gly Leu Tyr Met Gly Gin 
20 25 30 

Leu Glu Leu Asp Val Glu Leu Val Pro Glu Asp Asp Gly Thr Ala Ser 
35 40 45 
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Ala Glu Gly Pro Asp Glu Ala Gly Arg Pro Pro Pro Glu Xaa 
50 55 60 



<210> 169 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 169 

Met His Thr Ala Lys Met Ser Leu Leu Asn Ser Val Cys Leu Leu Val 
1 5 10 15 

Leu Ser lie Trp Tyr Val Val Lys Phe Pro Met Met Arg Asp Ser Thr 
20 25 30 

lie Asn Val Pro Tyr Leu Leu Arg Leu Lys Ala lie Thr Thr Xaa 
35 40 45 



<210> 170 
<211> 106 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (69) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (106) 

<223> Xaa equals stop translation 
<400> 170 

Met Ser Gly Leu Ala Ala Ala Ala His Val Phe Arg Val Cys Leu Phe 
15 10 15 

Pro Leu Ser Trp Gly Ser Ser Lys Thr Thr Phe lie His Gly Leu Ser 
20 25 30 

Ser Tyr lie Ala Thr Pro Val Leu Asn Ser He Phe Ser Ser Trp Lys 
3 5 40 45 

Ser Arg Arg Lys Asp Thr Trp Thr Cys Leu Leu His Arg Leu Ser Ala 
50 55 60 

Phe Pro He Ser Xaa Arg Arg Arg Asn Phe Ala Leu Phe Ser His Ser 
65 70 75 80 
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Cys Val Cys lie Arg Ser Ser Ser Asp Asp Val Gly Pro Thr Met Tyr 
85 90 95 

Ser Phe Ser Val Pro Cys Arg Val Lys Xaa 
100 105 



<210> 171 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 171 

Met His Leu Leu Thr Leu Phe Ser Ser Gly Leu lie Phe Leu Gly Cys 
15 10 15 

Ser Thr Pro Leu Ser Phe Cys Asp Cys Leu Pro lie Leu Leu Leu Trp 
20 25 30 

Leu Glu Phe Pro Val Glu Thr Ser Gly Val Cys Ser Xaa 
35 40 45 



<210> 172 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 172 

Met lie Leu Lys His Tyr He Leu Thr Phe He Phe Leu Phe He Phe 
15 10 15 

Leu Phe Phe Met Leu Asn He Leu His Ser Asn Ser Asn Leu He Asp 
20 25 30 

Leu Leu Lys Gly Asn He Arg Phe Arg Leu Leu Asn Ser Met Xaa 
35 40 45 



<210> 173 

<211> 42 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 



<400> 173 

Met Ala Thr Leu Gin lie Thr Thr Ala Met Lys lie Thr Met Met lie 
15 10 15 



Thr Met Val Met lie lie Thr Thr 
20 

Thr Thr Ala Met Met Met Ala Met 
35 40 



lie Val Glu Ala Met Lys lie Pro 
25 30 

Gin Xaa 



<210> 174 
<211> 47 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 



<400> 174 

Met Glu Met Leu Ser Ser Lys Trp Ser Lys Arg Val Ala Ala Ser Leu 
15 10 15 



Ala His Leu lie Ser Leu Phe lie Gly Leu Leu Phe Leu Leu Leu Gly 
20 25 30 

Ser Ser Val Tyr Pro Gly Thr Glu Thr Leu Phe Pro Lys Ser Xaa 
35 40 45 



<210> 175 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 175 

Met Trp Pro Ser Leu Gly Arg Cys Cys Leu Phe Phe Cys Leu Leu Thr 
15 10 15 

Asn Leu Thr Ser Cys His Thr Ser Gin lie Thr Leu Cys Ser Arg Glu 
20 25 30 

Thr Cys Val Trp Ser Arg Thr Thr Xaa 
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35 40 



<210> 176 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 176 

Met Tyr Leu Met Ser Phe Ser lie His Phe Val Lys lie lie Cys Met 
15 10 15 

Cys Thr He Leu Val Leu Ser Pro Pro Val Leu Leu Lys Tyr Gin Asp 
20 25 30 

Ser Thr Pro Arg Pro Leu Trp Ser Gin Cys Lys He Pro He Asn Tyr 
35 40 45 

Leu Lys Gly Lys Xaa 
50 



<210> 177 

<211> 250 

<212> PRT 

< 2 1 3 > Homo s ap i ens 

<220> 

<221> SITE 
<222> (250) 

<223> Xaa equals stop translation 
<400> 177 

Met Arg Gly Pro Ser Trp Ser Arg Pro Arg Pro Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Ser Pro Trp Pro Val Trp Ala Gin Val Ser Ala Arg Ala 
20 25 30 

Ser Pro Ser Gly Ser Leu Gly Ala Pro Asp Cys Pro Glu Val Cys Thr 
35 40 45 

Cys Val Pro Gly Gly Leu Pro Ala Val Gly Thr Leu Ala Ala Arg Arg 
50 55 60 

Ala Pro Gly Pro Glu Pro Ala Pro Ala Arg Ala Ala Ala Gly Pro Gin 
65 70 75 80 



Pro Arg Pro Cys Ala Ala Ala Arg Cys Leu Arg Gly Ser Gly Arg Ala 
85 90 95 
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Thr Ala Pro Gly Pro Ala Arg Glu Arg Ala Ala Leu Gly Ala Cys Ala 
100 105 110 

Ser Leu Leu Gly Pro Gly Arg Ala Ala Ala Ala Gly Pro Glu Arg Gin 
115 120 125 

Pro Ala Gly Ser Thr Gly Thr Arg Asp Phe Arg Ala Ala Ala Arg Ala 
130 135 140 

Ala Gin Pro Leu lie Gly Arg Gin Pro Ala Gly Ala Pro Gly Ala Arg 
145 150 155 160 

Gly Ala Arg Arg Ala Pro Ala Ala Ala Leu Thr Gin Pro Ala Gly Gin 
165 170 175 

Arg Ala Gly Gly Thr Arg Ala Gly Ala Ala Gly Pro Pro Ala Arg Ser 
180 185 190 

Arg Arg Ala Ala Pro Ala Arg Gin Pro Leu Gly Leu Arg Val Arg Ala 
195 200 205 

Ala Pro Ala Leu Arg Leu Ala Ala Pro Ala Pro Ala Ala Arg Val Arg 
210 215 220 

Gly Arg Asp Gly Ala Leu Arg Val Ala Gly Thr Pro Asp Ala Gin Pro 
225 230 235 240 

Pro Asp Cys Leu Phe Arg Arg Arg Leu Xaa 
245 250 



<210> 178 
<211> 148 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (148) 

<22 3> Xaa equals stop translation 
<400> 178 

Met Leu Ala Gly Ala Gly Arg Pro Gly Leu Pro Gin Gly Arg His Leu 
15 10 15 

Cys Trp Leu Leu Cys Ala Phe Thr Leu Lys Leu Cys Gin Ala Glu Ala 
20 25 30 

Pro Val Gin Glu Glu Lys Leu Ser Ala Ser Thr Ser Asn Leu Pro Cys 
35 40 45 



Trp Leu Val Glu Glu Phe Val Val Ala Glu Glu Cys Ser Pro Cys Ser 
50 55 60 
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Asn Phe Arg Ala Lys Thr Thr Pro Glu Cys Gly Pro Thr Gly Tyr Val 
65 70 75 80 

Glu Lys lie Thr Cys Ser Ser Ser Lys Arg Asn Glu Phe Lys Ser Cys 
85 90 95 

Arg Phe Ser Phe Glu Trp Asn Asn Ala Tyr Phe Gly Ser Ser Lys Gly 
100 105 " 110 

Ala Val Val Cys Val Ala Leu He Phe Ala Cys Leu Val He He Arg 
115 120 125 

Gin Arg Gin Leu Asp Arg Lys Ala Leu Glu Lys Val Arg Lys Gin He 
130 135 140 

Glu Ser He Xaa 
145 



<210> 179 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 179 

Met Phe Met Cys Arg Leu Leu Leu Trp Ala Thr Gly Ala Tyr Gly Phe 
15 10 15 

Leu Gly Asp Asp Val Glu Tyr Thr Ser Val Leu Pro His Gin Lys Gly 
20 25 30 

Lys Glu Ala Trp Val Phe He Cys Gin Leu Pro Phe He He Gly Xaa 
35 40 45 



<210> 180 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (57) 

<223> Xaa equals stop translation 
<400> 180 

Met Leu Gin Thr Leu Leu Cys Leu Trp Gin Tyr Thr Ser Ala Gin Val 
1 5 10 15 

Leu Lys Met Leu Cys He His Arg Gin Lys Trp Asp Asn Phe Trp Ala 
20 25 30 

Val Val Met He Asn Leu Leu He Arg He Gin Arg Leu Pro Phe Ser 
35 40 45 

Leu Pro He Ala Leu Arg Val Xaa Xaa 
50 55 



<210> 181 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals stop translation 
<400> 181 

Met Pro Ser Glu Gly Arg Leu Val Leu Leu Ser Ala Phe Cys Pro Ala 
15 10 15 

Phe Phe Pro Pro Trp Val Leu Ser Gly Ser Phe Ala Phe Ser Leu Cys 
20 25 30 

Ala Glu Ser His Leu Asn Ser Ser His Arg Arg He Ala Val Trp Thr 
35 40 45 

Xaa 



<210> 182 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals stop translation 
<400> 182 

Met Val Gin Trp Lys Asn Trp Pro Glu Ser Leu Glu Val Trp Val Leu 
15 10 15 
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Val Leu Ala Val Pro Leu Thr His 
20 

Glu Asp lie Ser Gin Val Leu His 
35 40 



Cys Asp Leu Gly lie Leu Cys Cys 
25 30 

Val Ser Gin Gin lie Xaa 
45 



<210> 183 
<211> 41 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 183 

Met Ala Leu Gly Leu Cys Ser Ser Gly Ala Leu Ser Thr Leu Cys Leu 
15 10 15 

Ser Ser Val Thr Cys Leu Ala lie Met Val Leu Met Ala Val Asp Gly 
20 25 30 

Leu His Gly Thr Ser Gly Leu Gly Xaa 
35 40 



<210> 184 

<211> 80 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 184 

Met Thr Leu Met Cys Leu Cys Leu Ser Val Thr Val Leu His Pro Leu 
15 10 15 

Arg Ser Lys Glu Arg Leu Ser Gly Thr Phe Cys Gly Tyr Ser Ser Ser 
20 25 30 

Trp Cys Ser Pro Ala Ser Glu Ser Ser Ser Pro Gly Ser Leu Leu Thr 
35 40 " 45 

Cys Ala Ala Ser Gly Ser His Pro Asp Cys Pro Leu Ser Gin Arg Leu 
50 55 60 



Leu Gly Val Gin Leu Ala Ala Leu Gly Arg Pro Gin Gly Leu Phe Xaa 
65 70 75 80 
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<210> 185 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 185 

Met Lys Ser Gin Cys Tyr Ser Pro Ser Tyr Phe Ala Phe Phe Cys Leu 
15 10 15 

Val Phe Phe Gin lie Thr Ser Ala Ser Ser Gin Thr Leu Arg Gly His 
20 25 30 

Val Leu Cys Arg Thr Thr Leu Arg Asp Ser Ser Ala Tyr Cys Xaa 
35 40 45 



<210> 186 
<211> 141 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (36) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (141) 

<223> Xaa equals stop translation 
<400> 186 

Met Phe Leu Phe Gly Gly Phe Leu Met Thr Leu Phe Gly Leu Phe Val 
15 10 15 

Ser Leu Val Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val 
20 25 30 

Trp Ser Arg Xaa Asn Pro Tyr Val Arg Met Asn Phe Phe Gly Leu Leu 
35 40 45 

Asn Phe Gin Ala Pro Phe Leu Pro Trp Val Leu Met Gly Phe Ser Leu 
50 55 60 

Leu Leu Gly Asn Ser lie lie Val Asp Leu Leu Gly lie Ala Val Gly 
65 70 75 80 
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His lie Tyr Phe Phe Leu Glu Asp Val Phe Pro Asn Gin Pro Gly Gly 
85 90 95 

lie Arg lie Leu Lys Thr Pro Ser lie Leu Lys Ala lie Phe Asp Thr 
100 105 110 

Pro Asp Glu Asp Pro Asn Tyr Asn Pro Leu Pro Glu Glu Arg Pro Gly 
115 120 125 

Gly Phe Ala Trp Gly Glu Gly Gin Arg Leu Gly Gly Xaa 
130 135 140 



<210> 187 
<211> 339 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (339) 

<223> Xaa equals stop translation 
<400> 187 

Met Arg Lys Pro Ala Ala Gly Phe Leu Pro Ser Leu Leu Lys Val Leu 
15 10 15 

Leu Leu Pro Leu Ala Pro Ala Ala Ala Gin Asp Ser Thr Gin Ala Ser 
20 25 30 

Thr Pro Gly Ser Pro Leu Ser Pro Thr Glu Tyr Glu Arg Phe Phe Ala 
35 40 45 

Leu Leu Thr Pro Thr Trp Lys Ala Glu Thr Thr Cys Arg Leu Arg Ala 
50 55 60 

Thr His Gly Cys Arg Asn Pro Thr Leu Val Gin Leu Asp Gin Tyr Glu 
6 $ 70 75 80 

Asn His Gly Leu Val Pro Asp Gly Ala Val Cys Ser Asn Leu Pro Tyr 
85 90 95 

Ala Ser Trp Phe Glu Ser Phe Cys Gin Phe Thr His Tyr Arg Cys Ser 
100 105 * 110 

Asn His Val Tyr Tyr Ala Lys Arg Val Leu Cys Ser Gin Pro Val Ser 
115 120 125 

lie Leu Ser Pro Asn Thr Leu Lys Glu lie Glu Ala Ser Ala Glu Val 
130 135 140 

Ser Pro Thr Thr Met Thr Ser Pro lie Ser Pro His Phe Thr Val Thr 
145 150 155 160 

Glu Arg Gin Thr Phe Gin Pro Trp Pro Glu Arg Leu Ser Asn Asn Val 
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165 170 175 

Glu Glu Leu Leu Gin Ser Ser Leu Ser Leu Gly Ser Gin Glu Gin Ala 
180 185 190 

Pro Glu His Lys Gin Glu Gin Gly Val Glu His Arg Gin Glu Pro Thr 
195 200 205 

Gin Glu His Lys Gin Glu Glu Gly Gin Lys Gin Glu Glu Gin Glu Glu 
210 215 220 

Glu Gin Glu Glu Glu Gly Lys Gin Glu Glu Gly Gin Gly Thr Lys Glu 
225 230 235 240 

Gly Arg Glu Ala Val Ser Gin Leu Gin Thr Asp Ser Glu Pro Lys Phe 
245 250 255 

His Ser Glu Ser Leu Ser Ser Asn Pro Ser Ser Phe Ala Pro Arg Val 
260 265 270 

Arg Glu Val Glu Ser Thr Pro Met lie Met Glu Asn lie Gin Glu Leu 
275 280 285 

lie Arg Ser Ala Gin Glu lie Asp Glu Met Asn Glu lie Tyr Asp Glu 
290 295 300 

Asn Ser Tyr Trp Arg Asn Gin Asn Pro Gly Ser Leu Leu Gin Leu Pro 
305 310 315 320 

His Thr Glu Pro Cys Trp Cys Cys Ala lie Arg Ser Trp Arg lie Pro 
325 330 335 

Ala Ser Xaa 



<210> 188 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 188 

Met Gin Arg lie Pro Thr Ser Pro Arg Gin Ala Trp Trp Trp Thr Cys 
15 10 15 

Trp Ala Met Phe Gin Gly Pro Ala Ala Gly Ser Val Gly Ala Glu Arg 
20 25 30 

Lys Gly Glu Gly Cys Leu Phe Phe Gly Gin Asp Glu Ser Ser Arg Cys 
35 40 45 
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Gly Arg Ser Trp Pro Leu Ala Asp Pro Trp Val Tyr Arg Val Leu Arg 
50 55 60 

Ser Xaa 
65 



<210> 189 
<211> 360 
<212> PRT 

<213> Homo sapiens 
<400> 189 

Met Val Pro Ala Ala Gly Arg Arg Pro Pro Arg Val Met Arg Leu Leu 
15 10 15 

Gly Trp Trp Gin Val Leu Leu Trp Val Leu Gly Leu Pro Val Arg Gly 
20 25 30 

Val Glu Val Ala Glu Glu Ser Gly Arg Leu Trp Ser Glu Glu Gin Pro 
35 40 45 

Ala His Pro Leu Gin Val Gly Ala Val Tyr Leu Gly Glu Glu Glu Leu 
50 55 60 

Leu His Asp Pro Met Gly Gin Asp Arg Ala Ala Glu Glu Ala Asn Ala 
65 70 75 80 

Val Leu Gly Leu Asp Thr Gin Gly Asp His Met Val Met Leu Ser Val 
85 90 95 

lie Pro Gly Glu Ala Glu Asp Lys Val Ser Ser Glu Pro Ser Gly Val 
100 105 110 

Thr Cys Gly Ala Gly Gly Ala Glu Asp Ser Arg Cys Asn Val Arg Glu 
115 120 125 

Ser Leu Phe Ser Leu Asp Gly Ala Gly Ala His Phe Pro Asp Arg Glu 
130 135 140 

Glu Glu Tyr Tyr Thr Glu Pro Glu Val Ala Glu Ser Asp Ala Ala Pro 
145 150 155 160 

Thr Glu Asp Ser Asn Asn Thr Glu Ser Leu Lys Ser Pro Lys Val Asn 
165 170 ~ 175 

Cys Glu Glu Arg Asn lie Thr Gly Leu Glu Asn Phe Thr Leu Lys lie 
180 185 190 

Leu Asn Met Ser Gin Asp Leu Met Asp Phe Leu Asn Pro Asn Gly Ser 
195 200 205 



Asp Cys Thr Leu Val Leu Phe Tyr Thr Pro Trp Cys Arg Phe Ser Ala 
210 215 220 
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Ser Leu Ala Pro His Phe Asn Ser Leu Pro Arg Ala Phe Pro Ala Leu 
225 230 235 240 

His Phe Leu Ala Leu Asp Ala Ser Gin His Ser Ser Leu Ser Thr Arg 
245 250 255 

Phe Gly Thr Val Ala Val Pro Asn He Leu Leu Phe Gin Gly Ala Lys 
260 265 270 

Pro Met Ala Arg Phe Asn His Thr Asp Arg Thr Leu Glu Thr Leu Lys 
275 280 285 

He Phe He Phe Asn Gin Thr Gly He Glu Ala Lys Lys Asn Val Val 
290 295 300 

Val Thr Gin Ala Asp Gin He Gly Pro Leu Pro Ser Thr Leu He Lys 
305 310 315 320 

Ser Val Asp Trp Leu Leu Val Phe Ser Leu Phe Phe Leu He Ser Phe 
325 330 335 

He Met Tyr Ala Thr He Arg Thr Glu Ser He Arg Trp Leu He Pro 
340 345 350 

Gly Gin Glu Gin Glu His Val Glu 
355 360 



<210> 190 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (160) 

<223> Xaa equals stop translation 
<400> 190 

Met Leu Leu Leu Leu lie Phe Trp He Ala Pro Ala His Gly Pro Thr 
15 10 15 

Asn He Met Val Tyr He Ser He Cys Ser Leu Leu Gly Ser Phe Thr 
20 25 30 

Val Pro Ser Thr Lys Gly He Gly Leu Ala Ala Gin Asp He Leu His 
35 40 45 

Asn Asn Pro Ser Ser Gin Arg Ala Leu Cys Leu Cys Leu Val Leu Leu 
50 55 60 



Ala Val Leu Gly Cys Ser He He Val Gin Phe Arg Tyr He Asn Lys 
65 70 75 80 
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Ala Leu Glu Cys Phe Asp Ser Ser 
85 

Val Phe Thr Thr Leu Val Leu Leu 
100 

Trp Ser Asn Val Gly Leu Val Asp 
115 120 

Thr Thr Val Ser Val Gly lie Val 
130 135 

Asn Phe Asn Leu Gly Glu Met Asn 
145 150 



Val Phe Gly Ala lie Tyr Tyr Val 
*0 95 

Ala Ser Ala lie Leu Phe Arg Glu 
105 no 

Phe Leu Gly Met Ala Cys Gly Phe 
125 

Leu lie Gin Val Phe Lys Glu Phe 
140 

Lys Ser Asn Met Lys Thr Asp Xaa 
155 * i 6 o 



<210> 191 
<211> 101 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (96) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (101) 

<223> Xaa equals stop translation 
<400> 191 

Met Phe Val Ala Val Phe Tyr Trp Val Leu Thr Val Phe Phe Leu He 
1 5 10 15 

He Tyr He Thr Met Thr Tyr Thr Arg He Pro Gin Val Pro Trp Thr 
20 25 30 

Thr Val Gly Leu Cys Phe Asn Gly Ser Ala Phe Val Leu Tyr Leu Ser 
35 40 45 

Ala Ala Val Val Asp Ala Ser Ser Val Ser Pro Glu Lys Asp Ser His 
50 55 60 

Asn Phe Asn Ser Trp Ala Ala Ser Ser Phe Phe Ala Phe Leu Val Thr 
65 70 75 80 
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lie Cys Tyr Ala Gly Asn Thr Tyr Phe Ser Phe Xaa Ala Trp Arg Xaa 
85 90 95 

Arg Thr lie Gin Xaa 
100 



<210> 192 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 192 

Met Phe Lys Leu Gin Leu Asp Leu Leu Thr Ala Val Asn Leu Val Tyr 
15 10 15 

Phe Ser Phe Leu Trp Val Val Ser Val Ala Asn Lys Met Asp Val Ser 
20 25 30 

Val Phe Glu Leu Val Asn Ser Asp Cys Phe Xaa 
35 40 



<210> 193 

<211> 62 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals stop translation 
<400> 193 

Met Ser Val Cys Val Phe Leu Asp Phe Arg Leu lie Phe Trp Ser Phe 
15 10 15 

Cys Pro Cys Ser Ala Ser Pro Ser Arg His Phe Ala Ser Ser Ser Arg 
20 25 30 

Gly Gly Gly Gly Gly Ser Arg Asn Trp Val Gly Ala Gly Ala Ser Leu 
35 40 45 

Ala Ala Ser Leu Ala Leu Tyr Ala Leu Ser Pro Arg Arg Xaa 
50 55 60 



<210> 194 
<211> 53 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 194 

Met Gin Ala Gin He Ser Ser Pro Arg Trp Thr Ser Trp Phe Ser Leu 
1 5 10 15 

Thr Ala Val Thr Leu Ala Phe Pro Ser Leu lie Pro Tyr Pro Ser Cys 
20 25 30 

Gly He Pro Val Leu Thr Gin Asp Ala Lys Trp Pro Ser Asp Tyr Thr 
35 40 45 

Ser Pro Asp Ser Xaa 
50 



<210> 195 
<211> 186 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (114) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (186) 

<223> Xaa equals stop translation 
<400> 195 

Met Thr Leu Leu Asn Leu Leu Leu Gin Thr He Phe Tyr Gly Val Thr 
1 5 10 " " 15 

Cys Leu Asp Asp Val Leu Lys Arg Thr Lys Gly Gly Lys Asp He Lys 
20 25 ' 30 

Phe Leu Thr Ala Phe Arg Asp Leu Leu Phe Thr Thr Leu Ala Phe Pro 
35 40 45 

Val Ser Thr Phe Val Phe Leu Ala Phe Trp He Leu Phe Leu Tyr Asn 
50 55 60 



Arg Asp Leu He Tyr Pro Lys Val Leu Asp Thr Val He Pro Val Trp 

65 70 75 80 

Leu Asn His Ala Met His Thr Phe He Phe Pro He Thr Leu Ala Glu 

85 90 95 
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Val Val Leu Arg Pro His Ser Tyr 
100 

Leu Xaa Ala Ala Ala Ser lie Ala 

115 120 

Tyr Phe Glu Thr Gly Thr Trp Val 
130 135 

Leu Leu Gly Leu Ala Ala Phe Phe 
145 150 

Ser lie Tyr Leu Leu Gly Glu Lys 
165 

Met Arg Gin Pro Arg Lys Lys Arg 
180 



Pro Ser Lys Lys Thr Gly Leu Thr 
105 110 

Tyr lie Ser Arg lie Leu Trp Leu 
125 

Tyr Pro Val Phe Ala Lys Leu Ser 
140 

Ser Leu Ser Tyr Val Phe lie Ala 
155 160 

Leu Asn His Trp Lys Trp Gly Asp 
170 175 

Lys Xaa 
185 



<210> 196 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (77) 

<223> Xaa equals stop translation 
<400> 196 

Met Lys Asn Ala Thr Leu Leu Arg Met Val Leu Phe Val lie Asn Leu 
15 10 15 

Gin Asn Leu Lys Ser Cys Pro Val Leu His lie His Gin Asp Val Gin 
20 25 30 

Gin Gin Lys Arg Met Gly His Gly Gly Ser Ser Thr Arg Val Thr Val 
35 40 45 

Thr Ser Leu lie Arg His Cys Thr Val Phe Gin Arg Pro Lys Asn Cys 
50 55 60 

Val Gin Asn Met lie Thr Leu Gin Leu Ser Phe Pro Xaa 
65 70 75 



<210> 197 
<211> 175 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (175) 

<223> Xaa equals stop translation 
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<400> 197 

Met Phe Val Pro Ser Cys Leu Cys Leu Arg Phe Val Val Thr Ser Leu 
15 10 15 

Leu Leu Gin Met Thr His Ser Cys Gly Gly Phe Tyr lie Cys Val lie 
20 25 30 

Phe Glu Thr He Leu Ser Glu Phe Lys Thr Gin He Gly Arg Leu Tyr 
35 40 45 

Arg Lys Arg His He Gin Arg Lys Glu Ser Pro Lys Gly Arg Phe Val 
50 55 60 

Met Leu Leu Pro Ser Ser Thr His Thr He Pro Phe Tyr Pro Asn Pro 
65 70 75 80 

Leu His Pro Arg Pro Phe Pro Ser Ser Arg Leu Pro Pro Gly He He 
85 90 95 

Gly Gly Glu Tyr Asp Gin Arg Pro Thr Leu Pro Tyr Val Gly Asp Pro 
100 105 HO 

He Ser Ser Leu He Pro Gly Pro Gly Glu Thr Pro Ser Gin Phe Pro 
115 120 125 

Pro Leu Arg Pro Arg Phe Asp Pro Val Gly Pro Leu Pro Gly Pro Asn 
130 135 140 

Pro He Leu Pro Gly Arg Gly Gly Pro Asn Asp Arg Phe Pro Phe Arg 
145 150 155 160 

Pro Ser Arg Gly Arg Pro Thr Asp Gly Arg Leu Ser Phe Met Xaa 
165 170 175 



<210> 198 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 198 

Met Gly Leu Lys Arg Lys Gin Gly Phe Val Phe Leu Phe Leu Leu Leu 
15 10 15 

Lys Ser Thr Val Ala Ser Trp Leu Leu Ser Gly Val Gly Arg He Trp 
20 25 30 



Gly Leu Val His Phe Val Lys Val Asn His Val Cys Leu Asn Asn Arg 
35 40 45 
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Gly Val Xaa 
50 



<210> 199 
<211> 190 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (190) 

<223> Xaa equals stop translation 
<400> 199 

Met Gly Pro Val Arg Leu Gly lie Leu Leu Phe Leu Phe Leu Ala Val 
1 5 10 15 

His Glu Ala Trp Ala Gly Met Leu Lys Glu Glu Asp Asp Asp Thr Glu 
20 25 30 

Arg Leu Pro Ser Lys Cys Glu Val Cys Lys Leu Leu Ser Thr Glu Leu 
35 40 45 

Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu Leu 
50 55 60 

Gly Gin Val Leu Asp Thr Gly Lys Arg Lys Arg His Val Pro Tyr Ser 
65 70 75 80 

Val Ser Glu Thr Arg Leu Glu Glu Ala Leu Glu Asn Leu Cys Glu Arg 
85 90 95 

lie Leu Asp Tyr Ser Val His Ala Glu Arg Lys Gly Ser Leu Arg Tyr 
100 105 110 

Ala Lys Gly Gin Ser Gin Thr Met Ala Thr Leu Lys Gly Leu Val Gin 
115 120 125 

Lys Gly Val Lys Val Asp Leu Gly lie Pro Leu Glu Leu Trp Asp Glu 
130 135 140 

Pro Ser Val Glu Val Thr Tyr Leu Lys Lys Gin Cys Glu Thr Met Leu 
145 150 155 160 

Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gly Gly Asp Lys Met Thr 
165 170 175 

Lys Thr Gly Ser His Pro Lys Leu Asp Arg Glu Asp Leu Xaa 
180 185 190 



<210> 200 
<211> 80 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 200 

Met Asn Tyr Ser Arg Ser Pro Trp Ala Ala Val Met Glu Pro Leu Thr 
1 5 10 15 

Leu Leu Phe Leu His Leu Ser Cys Leu Leu Ser Leu Cys Glu Ala Val 
20 25 30 

Gly Trp Asp Ser Glu Cys Leu Val Cys Ser Leu Gly Glu Glu Glu Phe 
35 40 " 45 

Leu Arg Met Gin Ala Leu Leu Cys Gly Cys Arg Leu His Leu Gly Gly 
50 55 60 

Val Leu Tyr Val Cys Thr Leu Gly Thr Ala Cys lie Trp Lys lie Xaa 
65 70 75 80 



<210> 201 
<211> 106 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (106) 

<223> Xaa equals stop translation 
<400> 201 

Met Asn Leu Gly Val Ser Met Leu Arg lie Leu Phe Leu Leu Asp Val 
15 10 15 

Gly Gly Ala Gin Val Leu Ala Thr Gly Lys Thr Pro Gly Ala Glu lie 
20 25 30 

Asp Phe Lys Tyr Ala Leu lie Gly Thr Ala Val Gly Val Ala lie Ser 
35 40 45 

Ala Gly Phe Leu Ala Leu Lys lie Cys Met lie Arg Arg His Leu Phe 
50 55 60 

Asp Asp Asp Ser Ser Asp Leu Lys Ser Thr Pro Gly Gly Leu Ser Asp 
65 70 75 80 



Thr lie Pro Leu Lys Lys Arg Ala Pro Arg Arg Asn His Asn Phe Ser 
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85 



90 



95 



Lys Arg Asp Ala Gin Val lie Glu Leu Xaa 
100 105 



<210> 202 

<211> 80 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 202 

Met Ala Cys Leu Gly Gly Leu Leu Gly lie He Gly Val He Cys Leu 
15 10 15 

He Ser Cys Leu Ser Pro Glu Met Asn Cys Asp Gly Gly His Ser Tyr 
20 25 30 

Val Arg Asn Tyr Leu Gin Lys Pro Thr Phe Ala Leu Gly Glu Leu Tyr 
35 40 45 

Pro Pro Leu He Asn Leu Trp Glu Ala Gly Lys Glu Lys Ser Thr Ser 
50 55 60 

Leu Lys Val Lys Ala Thr Val He Gly Leu Pro Thr Asn Met Ser Xaa 
65 70 75 80 



<210> 203 

<211> 58 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 203 

Met Gly Leu Lys Leu Leu Gin Lys Pro Gly Ser Leu Lys Thr Leu He 
15 10 15 

Ala He He Leu Val Met Tyr He Phe Met Thr He Ser Val He Ala 
20 25 30 



Gly Thr Gly Lys Phe Ser Gin Lys Leu Asp Leu His Leu Asn Met Asp 
35 40 45 
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lie Ser Pro Gly Arg Pro Ser Val His Xaa 
50 55 



<210> 204 
<211> 161 
<212> PRT 

<213> Homo sapiens 
<400> 204 

Met Asp Phe Leu Asn Pro Asn Gly Ser Asp Cys Thr Leu Val Leu Phe 
15 10 15 

Tyr Thr Pro Trp Cys Arg Phe Ser Ala Ser Leu Ala Pro His Phe Asn 
20 25 30 

Ser Leu Pro Arg Ala Phe Pro Ala Leu His Phe Leu Ala Leu Asp Ala 
35 40 45 

Ser Gin His Ser Ser Leu Ser Thr Arg Phe Gly Thr Val Ala Val Pro 
50 55 60 

Asn lie Leu Leu Phe Gin Gly Ala Lys Pro Met Ala Arg Phe Asn His 
65 70 75 80 

Thr Asp Arg Thr Leu Glu Thr Leu Lys He Phe He Phe Asn Gin Thr 
85 90 95 

Gly He Glu Ala Lys Lys Asn Val Val Val Thr Gin Ala Asp Gin lie 
100 105 HO 

Gly Pro Leu Pro Ser Thr Leu He Lys Ser Val Asp Trp Leu Leu Val 
115 120 125 

Phe Ser Leu Phe Phe Leu He Ser Phe He Met Tyr Ala Thr He Arg 
130 135 140 

Thr Glu Ser He Arg Trp Leu He Pro Gly Gin Glu Gin Glu His Val 
145 150 155 160 

Glu 



<210> 205 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 205 

lie Pro Glu Asn Arg Arg Pro Ala Ser Xaa Cys Thr Trp Ser Met Trp 
15 10 15 

Thr Ser Arg Thr Thr Thr Arg Arg Pro Pro Trp Gly Arg Phe Ser Ser 
20 25 30 

Val Ser Ser Ala Ser Val Ser Ser Thr Arg Lys Thr Trp Arg Thr Arg 
35 40 45 

Ser Thr Ser Cys Cys Arg Ser Ser Arg Arg Arg Val Ala Ala Pro Phe 
50 55 60 

Cys Thr Pro Ser Ala Ser Thr Glu Pro Ser Ala Arg Met Glu Pro Pro 
65 70 75 80 

Leu Glu Leu Pro Val Val His Thr Phe Ser Phe Leu Thr Phe Val Phe 
85 90 95 

Thr Tyr Arg Cys Ser Ala Gly Asp Gly Ser lie Thr Gin lie Asn Cys 
100 105 110 

Ala Tyr Glu Met Gly Glu Glu Met Pro Lys Arg Gin Met Lys Ala lie 
115 120 125 

Lys Phe Leu Leu Phe His Phe Tyr Leu 
130 135 



<210> 206 

<211> 41 

<212> PRT 

< 2 1 3 > Homo s ap i ens 

<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 206 

lie Pro Glu Asn Arg Arg Pro Ala Ser Xaa Cys Thr Trp Ser Met Trp 
15 10 15 

Thr Ser Arg Thr Thr Thr Arg Arg Pro Pro Trp Gly Arg Phe Ser Ser 
20 25 30 

Val Ser Ser Ala Ser Val Ser Ser Thr 
35 40 



<210> 207 

<211> 43 

<212> PRT 

<213> Homo sapiens 
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<400> 207 
Arg Lys Thr Trp 
1 

Arg Arg Val Ala 
20 

Ser Ala Arg Met 
35 



Arg Thr Arg Ser 
5 

Ala Pro Phe Cys 

Glu Pro Pro Leu 
40 



Thr Ser Cys Cys 
10 

Thr Pro Ser Ala 
25 

Glu Leu Pro 



Arg Ser Ser Arg 
15 

Ser Thr Glu Pro 
30 



<210> 208 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<400> 208 

Val Val His Thr Phe Ser Phe Leu Thr Phe -Val Phe Thr Tyr Arg Cys 
1 5 10 15 

Ser Ala Gly Asp Gly Ser lie Thr Gin lie Asn Cys Ala Tyr Glu Met 
20 25 30 

Gly Glu Glu Met Pro Lys Arg Gin Met Lys Ala He Lys Phe Leu Leu 
35 40 45 

Phe His Phe Tyr Leu 
50 



<210> 209 
<211> 223 
<212> PRT 
<213> Homo sapiens 

<400> 209 

His Pro Ser He He He Trp Ser Gly Asn Asn Glu Asn Glu Glu Ala 
15 10 15 

Leu Met Met Asn Trp Tyr His He Ser Phe Thr Asp Arg Pro He Tyr 
20 25 30 

He Lys Asp Tyr Val Thr Leu Tyr Val Lys Asn He Arg Glu Leu Val 
35 40 45 

Leu Ala Gly Asp Lys Ser Arg Pro Phe He Thr Ser Ser Pro Thr Asn 
50 55 60 

Gly Ala Glu Thr Val Ala Glu Ala Trp Val Ser Gin Asn Pro Asn Ser 
65 70 75 80 

Asn Tyr Phe Gly Asp Val His Phe Tyr Asp Tyr He Ser Asp Cys Trp 
85 90 95 

Asn Trp Lys Val Phe Pro Lys Ala Arg Phe Ala Ser Glu Tyr Gly Tyr 
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100 



105 



110 



Gin Ser Trp Pro Ser Phe Ser Thr Leu Glu Lys Val Ser Ser Thr Glu 
115 120 125 

Asp Trp Ser Phe Asn Ser Lys Phe Ser Leu His Arg Gin His His Glu 



Gly Gly Asn Lys Gin Met Leu Tyr Gin Ala Gly Leu His Phe Lys Leu 
145 150 155 160 

Pro Gin Ser Thr Asp Pro Leu Arg Thr Phe Lys Asp Thr lie Tyr Leu 
165 170 175 

Thr Gin Val Met Gin Ala Gin Cys Val Lys Thr Glu Thr Glu Phe Tyr 
180 185 190 

Arg Arg Ser Arg Ser Glu lie Val Asp Gin Gin Gly His Thr Met Gly 
195 200 205 

Ala Leu Tyr Trp Gin Leu Asn Asp lie Trp Gin Ala Pro Ser Trp 
210 215 220 



<210> 210 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<400> 210 

Val Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys Ser Arg Val 
1 5 10 15 

Thr Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys Leu Tyr Glu 
20 25 30 

Glu Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys Thr Arg Glu 
35 40 45 

Ser Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu Leu Leu Ser 
50 55 60 

Pro Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala Val Gly Leu 
65 70 75 80 

Cys Lys Ala Gin lie Thr Ala lie lie Ser Gin Gin Gly Asp lie Phe 
85 90 95 

Val Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val Trp Leu Asp 
100 105 110 

Val Gly Ser lie Pro Gly Arg Phe Ser Asp Asn Gly Phe Leu Met Thr 
115 120 125 

Glu Lys Thr Arg Thr lie Leu Phe Tyr Pro Trp Glu Pro Thr Ser Lys 



130 



135 



140 
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125 



130 135 



140 



Asn Glu Leu Glu Gin Ser Phe His Val Thr Ser Leu Thr Asp He Tyr 
145 150 155 160 



<210> 211 
<211> 171 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (102) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 211 

Pro Arg Leu Thr Pro Arg Met Lys Trp Pro Thr Ala Ala Leu Ala Ser 
15 10 15 

Arg Leu Leu Gly Trp Thr Val Leu Arg Pro Pro Tyr Pro Arg Val Pro 
20 25 30 

Ser Leu Pro Gin Val Thr Leu His Pro Thr Asp Gly Leu Met Ala Val 
35 40 45 

Leu Tyr Thr Gly Gly Glu Gly Arg Thr Leu Gly Glu Gin His Phe Phe 
50 55 60 

His Glu Thr Phe Val Thr Arg Trp Leu Leu Gly Pro Val Pro Val Arg 
65 70 75 80 

Phe Gly Ala Cys Ser Pro Leu Ser Phe Leu Ala Pro Arg Arg Gly Gin 
85 90 " 95 

Gly Ala Pro Ala Gly Xaa Phe Cys Ala Cys Pro Arg Pro Ala Ser Arg 
100 105 no 

Gin Leu Cys Pro Trp Pro Ala Leu Pro Gly Thr Pro Tyr Ser Asn Ser 
H5 120 125 

Ala Pro Leu Cys Thr Gly Met Gly His Ser Asn Thr Pro Gin Gly Pro 
130 135 140 

Pro Ser Pro Gin Tyr Ala Leu Ser Pro Thr Glu Pro Thr Ser Leu Ser 
145 150 155 160 

Gly Asn Ser His Leu Pro Ala He Leu Val Leu 
165 170 



<210> 212 
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<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 212 

Pro Arg Leu Thr Pro Arg Met Lys Trp Pro Thr Ala Ala Leu Ala Ser 
1 5 10 15 

Arg Leu Leu Gly Trp Thr Val Leu Arg Pro Pro Tyr Pro Arg Val Pro 
20 25 30 

Ser Leu Pro Gin Val Thr Leu His Pro 
35 40 



<210> 213 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 213 

Thr Asp Gly Leu Met Ala Val Leu Tyr Thr Gly Gly Glu Gly Arg Thr 
1 5 10 15 

Leu Gly Glu Gin His Phe Phe His Glu Thr Phe Val Thr Arg Trp Leu 
20 25 30 

Leu Gly Pro Val Pro Val Arg Phe Gly 
35 40 



<210> 214 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (20) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 214 

Ala Cys Ser Pro Leu Ser Phe Leu Ala Pro Arg Arg Gly Gin Gly Ala 
15 10 15 

Pro Ala Gly Xaa Phe Cys Ala Cys Pro Arg Pro Ala Ser Arg Gin Leu 
20 25 30 

Cys Pro Trp Pro Ala Leu Pro Gly Thr Pro 
35 40 



<210> 215 
<211> 47 
<212> PRT 
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<213> Homo sapiens 



<400> 215 

Tyr Ser Asn Ser Ala Pro Leu Cys 
1 5 

Pro Gin Gly Pro Pro Ser Pro Gin 
20 

Thr Ser Leu Ser Gly Asn Ser His 
35 40 



Thr Gly Met Gly His Ser Asn Thr 
10 15 

Tyr Ala Leu Ser Pro Thr Glu Pro 
25 30 

Leu Pro Ala lie Leu Val Leu 
45 



<210> 216 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<400> 216 

His Leu Leu Glu Val Thr Pro Cys 
1 5 

Gly Arg Thr Pro Arg Gly Ser Arg 
20 



Arg Leu Pro Val Pro Glu Phe Pro 
10 15 

Thr Pro Asp 
25 



<210> 217 
<211> 239 
<212> PRT 
<213> Homo sapiens 



<400> 217 
Met lie Pro Gly 
1 

Leu Met Lys Lys 
20 

Glu Ser Glu Leu 
35 

Lys Ser Glu Asn 
50 



Ser Asp Ser Gin 
5 

Lys Ser Asp Pro 

Ser lie Arg lie 
40 

Gly Glu Ala Tyr 
55 



Thr Ala Leu Asn 
10 

Glu Gly Pro Ala 
25 

Gly Arg Ala Gly 



Gin Arg Lys Lys 
60 



Phe Gly Ser Thr 
15 

Leu Leu Phe Pro 
30 

Leu Leu Ser Asp 
45 

Ala Ala Ala Thr 



Gly Leu Pro Glu Gly Pro Ala Val Pro Val Pro Ser Arg Gly Asn Leu 

65 70 75 80 

Ala Gin Pro Gly Gly Ser Ser Trp Arg Arg lie Ala Leu Leu lie Leu 
85 90 95 



Ala lie Thr lie His Asn Val Pro 
100 

Phe Gly Ala lie Glu Lys Thr Ala 
115 120 



Glu Gly Leu Ala Val Gly Val Gly 
105 HO 

Ser Ala Thr Phe Glu Ser Ala Arg 
125 
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Asn Leu Ala lie Gly lie Gly lie 
130 135 

Val Ser Leu Pro Leu Arg Gly Ala 
145 150 

Trp Tyr Gly Gin Leu Ser Gly Met 
165 

Gly Ala Phe Ala Val Val Leu Ala 
180 



Gin Asn Phe Pro Glu Gly Leu Ala 
140 

Gly Phe Ser Thr Trp Arg Ala Phe 
155 160 

Val Glu Pro Leu Ala Gly Val Phe 
170 175 

Glu Pro lie Leu Pro Tyr Ala Leu 
185 190 



Ala Phe Ala Ala Gly Ala Met Val Tyr Val Val Met Asp Asp lie He 
195 200 205 

Pro Glu Ala Gin He Ser Gly Asn Gly Lys Leu Ala Ser Trp Ala Ser 
210 215 220 

He Leu Gly Phe Val Val Met Met Ser Leu Asp Val Gly Leu Gly 
225 230 235 



<210> 218 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 218 

Met He Pro Gly Ser Asp Ser Gin Thr Ala Leu Asn Phe Gly Ser Thr 
15 10 15 

Leu Met Lys Lys Lys Ser Asp Pro Glu Gly Pro Ala Leu Leu Phe Pro 
20 25 30 

* Glu Ser Glu Leu Ser He Arg He Gly Arg Ala 
35 40 



<210> 219 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 219 

Gly Leu Leu Ser Asp Lys Ser Glu Asn Gly Glu Ala Tyr Gin Arg Lys 
1 5 10 15 

Lys Ala Ala Ala Thr Gly Leu Pro Glu Gly Pro Ala Val Pro Val Pro 
20 25 30 



Ser Arg Gly Asn Leu Ala Gin Pro Gly 
35 40 
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<210> 220 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 220 

Gly Ser Ser Trp Arg Arg lie Ala Leu Leu lie Leu Ala lie Thr lie 
1 5 10 15 

His Asn Val Pro Glu Gly Leu Ala Val Gly Val Gly Phe Gly Ala lie 
20 25 30 

Glu Lys Thr Ala Ser Ala Thr Phe Glu. Ser Ala Arg 
35 40 

<210> 221 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 221 

Asn Leu Ala lie Gly lie Gly He Gin Asn Phe Pro Glu Gly Leu Ala 
1 5 10 15 

Val Ser Leu Pro Leu Arg Gly Ala Gly Phe Ser Thr Trp Arg Ala Phe 
20 25 30 

Trp Tyr Gly Gin Leu Ser Gly Met Val Glu Pro 
35 40 

<210> 222 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 222 

Leu Ala Gly Val Phe Gly Ala Phe Ala Val Val Leu Ala Glu Pro He 
15 10 15 

Leu Pro Tyr Ala Leu Ala Phe Ala Ala Gly Ala Met Val Tyr Val Val 
20 25 30 

Met Asp Asp He He Pro Glu Ala Gin He Ser 
35 40 

<210> 223 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 223 

Gly Asn Gly Lys Leu Ala Ser Trp Ala Ser He Leu Gly Phe Val Val 
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10 15 



Met Met Ser Leu Asp Val Gly Leu Gly 
20 25 



<210> 224 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 224 

Thr Arg Pro lie Thr Tyr Val Leu Leu Ala Gly 
15 10 



<210> 225 

<211> 35 

<212> PRT 

<213> Homo sapiens 

<400> 225 

Gly Thr Ser Leu Thr Ala Pro Leu Leu Glu Phe Leu Leu Ala Leu Tyr 
15 10 15 

Phe Leu Phe Ala Asp Ala Met Gin Leu Asn Asp Lys Trp Gin Gly Leu 
20 25 30 

Cys Trp Pro 
35 



<210> 226 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 226 

Leu Ala Asn Phe Glx Cys Ser Asp Cys Ala Gin Thr Val Leu Phe Val 
15 10 15 

Leu Glx Phe Glx He Leu Val Phe Thr Tyr Glu He Pro Phe 
20 25 30 



<210> 227 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 227 

Gin Ala Trp His Glu Val Gly Gly Gly Val Arg Arg Cys Trp Phe Val 
15 10 15 

Leu Gly Glu Arg Arg Ala Gly Ser Leu Leu Ser Ala Ser Tyr Gly Thr 
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20 25 30 

Phe Ala Met Pro Gly Met Val Leu Phe Gly Arg Arg Trp Ala lie Ala 
35 40 ^ 45 

Ser Asp Asp Leu Val Phe Pro Gly Phe Phe Glu Leu Val Val Arg Val 
50 55 60 

Leu Trp Trp lie Gly lie Leu Thr Leu Tyr Leu 
65 70 ~ 75 



<210> 228 
<211> 125 
<212> PRT 

<213> Homo sapiens 



<400> 228 
Pro Gly Met Val 
1 

Leu Val Phe Pro 
20 

lie Gly lie Leu 
35 

Ala Gly Gly Ala 
50 

Leu Ala Val Val 
65 



Leu Phe Gly Arg 
5 

Gly Phe Phe Glu 

Thr Leu Tyr Leu 
40 

Leu Leu Ser Ser 
55 

lie Cys Thr Val 
70 



Arg Trp Ala He 
10 

Leu Val Val Arg 
25 

Met His Arg Gly 



Tyr Leu He Val 
60 

Ser Ala lie Met 
75 



Ala Ser Asp Asp 
15 

Val Leu Trp Trp 
30 

Lys Leu Asp Cys 
45 

Leu Met lie Leu 



Cys Val Ser Met 
80 



Arg Gly Thr He Cys Asn Pro Gly Pro Arg Lys Ser Met Ser Lys Leu 
85 90 95 

Leu Tyr He Arg Leu Ala Leu Phe Phe Pro Glu Met Val Trp Ala Ser 
100 105 110 

Leu Gly Ala Ala Trp Val Ala Asp Gly Val Gin Cys Asp 
115 120 " 125 



<210> 229 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 229 

His Glu Arg Asn Cys Phe Pro Met Trp Leu Asn His Ser Ala Phe Pro 
15 10 15 

Pro Val 
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<210> 230 
<211> 132 
<212> PRT 
<213> Homo sapiens 

<400> 230 

Gly Trp Thr Arg Glu Asn Asp His Arg Ala Leu Ser Lys Ala Gly He 
15 10 15 

Gly Ser Ala Glu He Gin Pro Ser Asn Leu Arg Val Gly Ser Ala Lys 
20 25 30 

Asp Leu Gly Lys Pro Trp Ala Gly Lys Leu Leu Leu Leu Ser Ser Cys 
35 40 45 

Leu Leu Phe Phe Ser Leu Gly Val Leu Tyr Arg Gly Gin Met Leu Ala 
50 55 60 

Pro Pro Leu Gin Glu Asp Trp Lys Gly Gly Val Lys Asp Ser Asp Leu 
65 70 75 80 

He Asp Asp Ser Ser Ala Ser Pro He Pro Pro Ser Tyr Leu Glu Tyr 
85 90 95 

Lys Ala Ala Leu Tyr Pro Phe Ser Glu His Lys Ser Val Arg Asn Ala 
100 105 HO 

Thr Asp Ser Leu Thr Phe Phe Leu Val Thr Asp His Phe Leu Asp Asn 
115 120 125 

Gin Asp Ser Gin 
130 



<210> 231 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 231 

Gly Trp Thr Arg Glu Asn Asp His 
1 5 

Gly Ser Ala Glu He Gin Pro Ser 
20 

Asp Leu Gly Lys Pro Trp Ala Gly 
35 40 



Arg Ala Leu Ser Lys Ala Gly He 
10 15 

Asn Leu Arg Val Gly Ser Ala Lys 
25 30 

Lys Leu Leu Leu Leu 
45 



<210> 232 

<211> 46 

<212> PRT 

<213> Homo sapiens 



WO 99/47540 



133 



PCT/US99/05804 



<400> 232 

Ser Ser Cys Leu Leu Phe Phe Ser 
1 5 

Met Leu Ala Pro Pro Leu Gin Glu 
20 

Ser Asp Leu lie Asp Asp Ser Ser 
35 40 



Leu Gly Val Leu Tyr Arg Gly Gin 
10 15 

Asp Trp Lys Gly Gly Val Lys Asp 
25 30 

Ala Ser Pro lie Pro Pro 
45 



<210> 233 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 233 

Ser Tyr Leu Glu Tyr Lys Ala Ala Leu Tyr Pro Phe Ser Glu His Lys 
15 10 15 

Ser Val Arg Asn Ala Thr Asp Ser Leu Thr Phe Phe Leu Val Thr Asp 
20 25 30 

His Phe Leu Asp Asn Gin Asp Ser Gin 
35 40 



<210> 234 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 234 

Leu Lys Phe His Gin Glu Ser Leu Ser Gly Asp 
1 5 10 



<210> 235 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 235 

Glu Ala Lys Ser Arg Pro Val Thr Gin Ala Gly Val Gin Trp His Asp 
15 10 15 

Leu Gly Ser Leu Gin Pro Leu Pro Pro 
20 25 



<210> 236 

<211> 25 

<212> PRT 

<213> Homo sapiens 
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<400> 236 

Glu Ala Lys Ser Arg Pro Val Thr Gin Ala Gly Val Gin Trp His Asp 
15 10 15 

Leu Gly Ser Leu Gin Pro Leu Pro Pro 
20 25 



<210> 237 
<211> 137 
<212> PRT 
<213> Homo sapiens 

<400> 237 

Ala Leu Val Leu Val Cys Arg Gin Arg Tyr Cys Arg Pro Arg Asp Leu 
15 10 15 

Leu Gin Arg Tyr Asp Ser Lys Pro lie Val Asp Leu He Gly Ala Met 
20 25 30 

Glu Thr Gin Ser Glu Pro Ser Glu Leu Glu Leu Asp Asp Val Val He 
35 40 45 

Thr Asn Pro His He Glu Ala He Leu Glu Asn Glu Asp Trp He Glu 
50 55 60 

Asp Ala Ser Gly Leu Met Ser His Cys He Ala He Leu Lys He Cys 
65 70 75 80 

His Thr Leu Thr Glu Lys Leu Val Ala Met Thr Met Gly Ser Gly Ala 
85 90 95 

Lys Met Lys Thr Ser Ala Ser Val Ser Asp He He Val Val Ala Lys 
100 105 HO 

Arg He Ser Pro Arg Val Asp Asp Val Val Lys Ser Met Tyr Pro Pro 
115 120 125 

Leu Asp Pro Lys Leu Leu Asp Ala Arg 
130 135 



<210> 238 
<211> 319 
<212> PRT 
<213> Homo sapiens 

<400> 238 

Asp Val Glu Ser Arg Gly Pro Ser Ala Arg Cys Leu Pro Val Val Pro 
15 10 15 

Gly Ser Leu Leu Pro Gly Leu Glu Pro Ala Thr Lys Leu Met Pro Gly 
20 25 30 
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Gly Leu Ala Pro Gly His Gly Ala Pro Val Arg Glu Leu Leu Leu Pro 
35 40 45 

Leu Leu Ser Gin Pro Thr Leu Gly Ser Leu Trp Asp Ser Leu Arg His 
50 55 60 

Cys Ser Leu Leu Cys Asn Pro Leu Ser Cys Val Pro Ala Leu Glu Ala 
65 70 75 80 

Pro Pro Ser Leu Val Ser Leu Gly Cys Ser Gly Gly Cys Pro Arg Leu 
85 90 " 95 

Ser Leu Ala Gly Ser Ala Ser Pro Phe Pro Phe Leu Thr Ala Leu Leu 
100 105 110 

Ser Leu Leu Asn Thr Leu Ala Gin lie His Lys Gly Leu Cys Gly Gin 
115 120 125 

Leu Ala Ala lie Leu Ala Ala Pro Gly Leu Gin Asn Tyr Phe Leu Gin 
130 135 140 

Cys Val Ala Pro Gly Ala Ala Pro His Leu Thr Pro Phe Ser Ala Trp 
145 150 155 160 

Ala Leu Arg His Glu Tyr His Leu Gin Tyr Leu Ala Leu Ala Leu Ala 
165 • 170 175 

Gin Lys Ala Ala Ala Leu Gin Pro Leu Pro Ala Thr His Ala Ala Leu 
180 185 190 

Tyr His Gly Met Ala Leu Ala Leu Leu Ser Arg Leu Leu Pro Gly Ser 
195 200 205 

Glu Tyr Leu Thr His Glu Leu Leu Leu Ser Cys Val Phe Arg Leu Glu 
210 215 220 

Phe Leu Pro Glu Arg Thr Ser Gly Gly Pro Glu Ala Ala Asp Phe Ser 
225 230 235 240 

Asp Gin Leu Ser Leu Gly Ser Ser Arg Val Pro Arg Cys Gly Gin Gly 
245 250 ~ 255 

Thr Leu Leu Ala Gin Ala Cys Gin Asp Leu Pro Ser He Arg Asn Cys 
260 265 270 

Tyr Leu Thr His Cys Ser Pro Ala Arg Ala Ser Leu Leu Ala Ser Gin 
275 280 285 

Ala Leu His Arg Gly Glu Leu Gin Arg Val Pro Thr Leu Leu Leu Pro 
290 295 300 

Met Pro Thr Glu Pro Leu Leu Pro Thr Asp Trp Pro Phe Leu His 
305 310 " 315 
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<210> 239 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 239 

Asp Val Glu Ser Arg Gly Pro Ser 
1 5 

Gly Ser Leu Leu Pro Gly Leu Glu 
20 

Gly Leu Ala Pro Gly His Gly Ala 
35 40 



Ala Arg Cys Leu Pro Val Val Pro 
10 15 

Pro Ala Thr Lys Leu Met Pro Gly 
25 30 

Pro Val Arg Glu 



<210> 240 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 240 

Leu Leu Leu Pro Leu Leu Ser Gin Pro Thr Leu Gly Ser Leu Trp Asp 
15 10 15 

Ser Leu Arg His Cys Ser Leu Leu Cys Asn Pro Leu Ser Cys Val Pro 
20 25 30 

Ala Leu Glu Ala Pro Pro Ser Leu Val Ser Leu Gly Cys 
35 40 45 



<210> 241 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Ser Gly Gly Cys Pro Arg Leu Ser Leu Ala Gly Ser Ala Ser Pro Phe 
15 10 15 

Pro Phe Leu Thr Ala Leu Leu Ser Leu Leu Asn Thr Leu Ala Gin lie 
20 25 30 

His Lys Gly Leu Cys Gly Gin Leu Ala Ala lie Leu Ala 
35 40 45 



<210> 242 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 242 

Ala Pro Gly Leu Gin Asn Tyr Phe Leu Gin Cys Val Ala Pro Gly Ala 
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15 10 15 

Ala Pro His Leu Thr Pro Phe Ser Ala Trp Ala Leu Arg His Glu Tyr 
20 25 30 

His Leu Gin Tyr Leu Ala Leu Ala Leu Ala Gin Lys 
35 40 



<210> 243 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 243 

Ala Ala Ala Leu Gin Pro Leu Pro Ala Thr His Ala Ala Leu Tyr His 
15 10 15 

Gly Met Ala Leu Ala Leu Leu Ser Arg Leu Leu Pro Gly Ser Glu Tyr 
20 25 30 

Leu Thr His Glu Leu Leu Leu Ser Cys Val Phe Arg 
35 40 



<210> 244 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 244 

Leu Glu Phe Leu Pro Glu Arg Thr Ser Gly Gly Pro Glu Ala Ala Asp 
15 10 15 

Phe Ser Asp Gin Leu Ser Leu Gly Ser Ser Arg Val Pro Arg Cys Gly 
20 25 30 

Gin Gly Thr Leu Leu Ala Gin Ala Cys Gin Asp Leu 
35 40 



<210> 245 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<400> 245 

Pro Ser lie Arg Asn Cys Tyr Leu Thr His Cys Ser Pro Ala Arg Ala 
1 5 10 15 

Ser Leu Leu Ala Ser Gin Ala Leu His Arg Gly Glu Leu Gin Arg Val 
20 25 30 

Pro Thr Leu Leu Leu Pro Met Pro Thr Glu Pro Leu Leu Pro Thr Asp 
35 40 45 
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Trp Pro Phe Leu His 
50 



<210> 246 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 246 

Val Gly Ser Val Leu Gly Ala Phe Leu Thr Phe Pro Gly Leu Arg Leu 
15 10 15 

Ala Gin Thr His Arg Asp Ala Leu Thr 
20 25 



<210> 247 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (21) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (37) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (57) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 247 

Leu Glu Cys Thr Asp Thr lie Met Val His Cys Ser Leu Lys Leu Leu 
15 10 15 

Ser Pro Ser Asp Xaa Ser His Ser Ala Ser Gin Val Ala Lys Thr Arg 
20 25 30 

Gly Val His His Xaa Thr Gin Leu lie Phe Lys Val Phe Phe Val Xaa 
35 40 45 



Met Gly Ser His Ser Thr Lys Tyr Xaa Ser lie Arg Pro Gly Leu Leu 
50 55 60 
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Pro 
65 



<210> 248 

<211> 14 

<212> PRT 

<213> Homo sapiens 



<400> 248 

Glu Ser Ser Phe Val Pro Pro Ala Ala His Ser Ser Leu Cys 
1 5 io 



<210> 249 
<211> 172 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (72) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 249 

Leu Leu Pro Gly Gin Gin Glu Ala Thr Gin Cys Val Glu Ala Gly Ala 
1 5 10 15 

Gly Glu Gly Ala Leu Thr Pro Met Cys Pro Cys Arg Gin Glu Gin Phe 
20 25 30 

Val Asp Leu Tyr Lys Glu Phe Glu Pro Ser Leu Val Asn Ser Thr Val 
35 40 45 

Tyr lie Met Ala Met Ala lie Gin Met Ala Pro Phe Ala He Asn Tyr 
50 55 60 

Lys Val Arg Pro Gly Pro Cys Xaa Asn He His Cys Leu Pro Thr Gin 
65 70 75 80 

Pro His Pro Met Lys Pro Ser Val Pro His Pro His Arg Ala Arg Pro 
85 90 95 

Ser Trp Arg Ala Cys Pro Arg Thr Ser Pro Trp Cys Gly Val Trp Gin 
100 105 110 

Phe His Ser Trp Pro Ser Leu Ala Cys Ser Ser Ala Pro Arg Pro Thr 
115 120 125 

Ser Thr Ala Ser Leu Ala Ser Trp Thr Ser Leu Trp Ser Ser Ser Trp 
130 135 140 

Ser Leu Pro Arg Ser Cys Ser Trp Thr Ser Ala Trp Arg Ser Trp Pro 
145 "0 155 i 6 o 
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Thr Ala Ser Cys Ser Ser Ser Trp Gly Pro Arg Ser 
165 170 



<210> 250 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 250 

Leu Leu Pro Gly Gin Gin Glu Ala Thr Gin Cys Val Glu Ala Gly Ala 
15 10 15 

Gly Glu Gly Ala Leu Thr Pro Met Cys Pro Cys Arg Gin Glu Gin Phe 
20 25 30 

Val Asp Leu Tyr Lys Glu Phe Glu Pro Ser Leu Val Asn 
35 40 45 



<210> 251 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 251 

Ser Thr Val Tyr He Met Ala Met Ala He Gin Met Ala Pro Phe Ala 
15 10 15 

He Asn Tyr Lys Val Arg Pro Gly Pro Cys Xaa Asn He His Cys Leu 
20 25 30 

Pro Thr Gin Pro His Pro Met Lys Pro Ser Val Pro 
35 40 



<210> 252 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 252 

His Pro His Arg Ala Arg Pro Ser Trp Arg Ala Cys Pro Arg Thr Ser 
15 10 15 

Pro Trp Cys Gly Val Trp Gin Phe His Ser Trp Pro Ser Leu Ala Cys 
20 25 30 



Ser Ser Ala Pro Arg Pro Thr Ser Thr Ala 
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35 40 



<210> 253 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 253 

Ser Leu Ala Ser Trp Thr Ser Leu Trp Ser Ser Ser Trp Ser Leu Pro 
15 10 15 

Arg Ser Cys Ser Trp Thr Ser Ala Trp Arg Ser Trp Pro Thr Ala Ser 
20 25 30 

Cys Ser Ser Ser Trp Gly Pro Arg Ser 
35 40 



<210> 254 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 254 

Thr Arg Asn lie Leu Ser Phe lie Lys Cys Val lie His Asn Phe Trp 
15 10 15 

lie Pro Lys Glu Ser Asn Glu lie Thr He He lie Asn Pro Tyr Arg 
20 25 30 

Glu Thr Val Cys Phe Ser Val Glu Pro Val Lys Lys He Phe Asn Tyr 
35 40 45 



<210> 255 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 255 

Leu Val Val Leu Phe Ala Ser Ser Asn Ser Arg Tyr Leu Lys Tyr Phe 
1 5 10 15 

Phe Leu Val Pro Leu He Leu Gly Ser Ala Trp 
20 25 



<210> 256 
<211> 20 
<212> PRT 

<213> Homo sapiens 
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<400> 256 

His Glu Trp Lys Cys Lys Gin Lys Tyr Ser Glu Gly Ser Gly Asn Thr 
1 5 10 15 

Arg lie Gly Asn 
20 



<210> 257 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 257 

Leu Leu Pro Leu Cys Phe Leu Gly Pro Arg Gin Val Leu Glu Glu Phe 
1 5 10 15 

Pro Ser lie Val 
20 



<210> 258 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 258 

Pro Thr Arg Pro Ser Lys His Gin Glu Ala Gly Ser 
15 10 



<210> 259 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (39) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 259 

Gly Gin Gly Pro Ala Gly Arg Trp Val Arg Arg Leu Pro Cys Ser Arg 
15 10 15 

Arg Ala Gly Gly Glu Arg Gly Pro His Trp Gly Val Trp Ala Gly Pro 
20 25 30 

Gin Met Ser Cys Gly Leu Xaa Phe Gly Pro 
35 40 



<210> 260 
<211> 193 
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<212> PRT 

<213> Homo sapiens 
<400> 260 

Trp Arg Thr Gin Gly Pro Met Val Leu Leu Trp Val Val Thr Cys Pro 
1 5 .10 15 

Ala Thr Met Leu Thr Glu Pro Gin Asn Pro His Leu lie Gly Phe Val 
20 25 30 

Ala Tyr Ser Gly Pro Ser His Thr Thr Gin Pro His Lys Tyr Trp Leu 
35 40 45 

Leu Leu Asp Gly Gin Ala Asp Pro Ala Ala Ala Glu Gly Pro Val Lys 
50 55 60 

Arg Lys Ala Ala Ser Val Val Trp Trp Pro Gin Ala Leu Arg His Leu 
65 70 75 go 

Ser Leu Leu Val His Cys Trp Glu Glu Ser Tyr Glu Met Asn He Gly 
85 90 95 

Cys Gin Ser Leu Trp Ala Gly Gly Leu Ala Ser Ser Gly Asn Gly Trp 
100 105 HO 

Asp Leu Gly Val Ala Phe Arg Arg Asp Thr Cys Met Ser Ser Ser Ser 
115 120 125 

Leu His Trp Lys Glu Phe Lys Tyr Ala Pro Gly Ser Leu His Tyr Phe 
130 135 140 

Ala Leu Ser Phe Val Leu He Leu Thr Glu He Cys Leu Val Ser Ser 
145 150 155 160 

Gly Met Gly Phe Pro Gin Glu Gly Lys His Phe Ser Val Leu Gly Ser 
165 170 175 

Pro Asp Cys Ser Leu Trp Gly Arg Asp Glu His Val Pro Arg Glu Phe 
180 185 190 

Ala 



<210> 261 
<211> 42 
<212> PRT 

<213> Homo sapiens 



<400> 261 

Trp Arg Thr Gin Gly Pro Met Val Leu Leu Trp Val Val Thr Cys Pro 
1 5 10 " 



15 



Ala Thr Met Leu Thr Glu Pro Gin Asn Pro His Leu He Gly Phe Val 
20 25 30 
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Ala Tyr Ser Gly Pro Ser His Thr Thr Gin 
35 40 



<210> 262 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 262 

Pro His Lys Tyr Trp Leu Leu Leu Asp Gly Gin Ala Asp Pro Ala Ala 
15 10 15 

Ala Glu Gly Pro Val Lys Arg Lys Ala Ala Ser Val Val Trp Trp Pro 
20 25 30 

Gin Ala Leu Arg His Leu Ser Leu Leu 
35 40 



<210> 263 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 263 

Val His Cys Trp Glu Glu Ser Tyr Glu Met Asn He Gly Cys Gin Ser 
15 10 15 

Leu Trp Ala Gly Gly Leu Ala Ser Ser Gly Asn Gly Trp Asp Leu Gly 
20 25 30 

Val Ala Phe Arg Arg Asp Thr Cys Met 
35 40 



<210> 264 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 264 

Ser Ser Ser Ser Leu His Trp Lys Glu Phe Lys Tyr Ala Pro Gly Ser 
15 10 15 

Leu His Tyr Phe Ala Leu Ser Phe Val Leu He Leu Thr Glu He Cys 
20 25 30 

Leu Val Ser Ser Gly Met Gly Phe Pro Gin Glu Gly 
35 40 



<210> 265 
<211> 25 
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<212> PRT 

<213> Homo sapiens 

<400> 265 

Lys His Phe Ser Val Leu Gly Ser 
1 5 

Asp Glu His Val Pro Arg Glu Phe 
20 



Pro Asp Cys Ser Leu Trp Gly Arg 
10 15 

Ala 
25 



<210> 266 

<211> 31 

<212> PRT 

<213> Homo sapiens 



<400> 266 

lie Ala Gin Gly Thr Val Pro Leu 
1 5 

Gly Pro Asp Tyr Pro Glu Gly Thr 
20 



Thr Lys Arg Gly Val Gin Ser Ser 
10 15 

Leu Thr Pro Leu Pro Arg Gly 
25 30 



<210> 267 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 267 

lie Ala Gin Gly Thr Val Pro Leu 
1 5 

Gly Pro Asp Tyr Pro Glu Gly Thr 
20 



Thr Lys Arg Gly Val Gin Ser Ser 
10 15 

Leu Thr Pro Leu Pro Arg Gly 
25 30 



<210> 268 

<211> 28 

<212> PRT 

<213> Homo sapiens 



<400> 268 

Asp Cys Leu Tyr Leu Ala Leu Ser 
1 5 

His His Pro Pro Ser Gly Ser Leu 
20 



Phe Pro Trp His Cys His Cys His 
10 15 

Leu Tyr Pro Phe 
25 



<210> 269 
<211> 101 
<212> PRT 

<213> Homo sapiens 
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<400> 269 

Ala Ser Leu Pro Pro 
1 5 

Val Pro Cys Gin Val 
20 

lie Leu Cys Asn Tyr 
35 

Gly Gly Ser Trp Lys 
50 

Val Phe Phe Gly lie 
65 

Arg Gly Ser Gly Asn 
85 

Ser Leu Arg Asp Trp 
100 



Ser Arg Ser Arg Pro Leu 
10 

Leu Arg Met Ala lie Leu 
25 

Lys Ala He Glu Met Pro 
40 

Phe Leu Thr Phe He Asp 
55 

Cys Val Leu Thr Asp Leu 
70 75 

Gin Glu Gin Glu Arg Gin 
90 



Ala Asn Met Ala Leu 
15 

Leu Ser Tyr Cys Ser 
30 

Ser His Gin Thr Tyr 
45 

Leu Val He Gin Ala 
60 

Ser Ser Leu Leu Thr 
80 

Leu Lys Lys Leu He 
95 



<210> 270 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 270 

Met Ser Arg Ser Ser Arg He Ser Gly Leu Ser Cys Pro Trp Leu Leu 
15 10 15 



<210> 271 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 271 

Asp His Trp Pro Ala Gly Phe Leu 
1 5 

Pro Val Ala Leu Glu Val Phe Arg 
20 

Thr Asp Cys Ser Gly Ser Ala Gly 
35 40 



Pro Pro Ala Pro Gly Leu Lys Phe 
10 15 

Lys Val Leu Pro Ala Val Cys Pro 
25 30 

Lys Glu Arg Asn Ser 
45 



<210> 272 
<211> 47 
<212> PRT 
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<213> Homo sapiens 
<400> 272 

Glu Glu lie Ala Thr Ser lie Glu 
1 5 

Val Phe Phe Ala Ser lie Gly Leu 
20 

Tyr Glu Leu Thr Val Leu Val Phe 
35 40 



Pro lie Arg Asp Phe Leu Ala lie 
10 15 

His Val Phe Pro Thr Phe Val Ala 
25 30 

Leu Thr Leu Ser Val Val Val 
45 



<210> 273 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 273 

Tyr Cys Asn Leu Gin Cys Arg 
1 5 



<210> 274 
<211> 44 
<212> PRT 

<213> Homo sapiens 



<400> 274 
Ser Ala Leu lie 
1 

Val Val Leu Arg 
20 

Phe Gin Ala lie 
35 



Gly Asn Pro Lys 
5 

Glu Trp Ser Val 

Cys Lys Leu Lys 
40 



Gly Cys Phe Gly 
10 

Glu Ser Trp Lys 
25 

Thr Asn Phe Arg 



Cys Phe Ser Pro 
15 

Ser Leu Arg Pro 
30 



<210> 275 
<211> 8 
<212> PRT 

<213> Homo sapiens 
<400> 275 

His Glu Ala Ala Leu Arg Gly Pro 
1 5 



<210> 276 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 276 
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Ser Asn Ala Ala Gly Asn Val Val Arg Ala Phe Leu Tyr lie Asn His 
15 10 15 

Leu Lys Leu Gly Cys Lys Val Gly Leu Ala 
20 25 



<210> 277 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 277 

Asn Trp Ala Val Leu Asn Met Leu Leu Ser Lys Gly Lys lie Thr lie 
15 10 15 

Phe Leu Gly Pro Leu Glu Cys Gly Ser 
20 25 



<210> 278 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<400> 278 

Pro Ser His Gin Thr Arg Lys Gly Lys Ser Ala Lys Leu Leu Asp Arg 
15 10 15 

Pro Pro Glu Ala Leu Arg Met Lys lie lie Thr Thr Thr Leu Leu Leu 
20 25 30 

Ala Cys His Leu Gin Leu Glu Val Gly Val Val Val Gly Gly Glu Val 
35 40 45 

Asp 



<210> 279 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<400> 279 

Phe Gin Ala Ser Ser Ala Asn Asn Gin Gin Asn Trp Gly Ser Gin Pro 
1 5 10 15 

lie Ala Gin Gin Pro Leu Gin Gin Gly Gly Asp Tyr Ser Gly Asn Tyr 
20 25 30 

Gly Tyr Asn Asn Asp Asn Gin Glu Phe Tyr Gin Asp Thr Tyr Gly Gin 
35 40 45 

Gin Trp Lys 
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<210> 280 
<211> 264 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2) 

<223> Xaa equals any of the naturally occurring L-amino aci^s 
<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (14) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 280 

Trp Xaa Pro Leu Leu Xaa Thr Ser Gly Ser Pro Gly Leu Xaa Gly Phe 
1 5 10 15 

Gly Thr Arg Met Asn Gly Lys Glu He Glu Gly Glu Glu He Glu He 
20 25 30 

Val Leu Ala Lys Pro Pro Asp Lys Lys Arg Lys Glu Arg Gin Ala Ala 
35 40 45 

Arg Gin Ala Ser Arg Ser Thr Ala Tyr Glu Asp Tyr Tyr Tyr His Pro 
50 55 60 

Pro Pro Arg Met Pro Pro Pro He Arg Gly Arg Gly Arg Gly Gly Gly 
65 70 75 80 

Arg Gly Gly Tyr Gly Tyr Pro Pro Asp Tyr Tyr Gly Tyr Glu Asp Tyr 
85 90 95 

Tyr Asp Asp Tyr Tyr Gly Tyr Asp Tyr His Asp Tyr Arg Gly Gly Tyr 
100 105 no 

Glu Asp Pro Tyr Tyr Gly Tyr Asp Asp Gly Tyr Ala Val Arg Gly Arg 
H5 120 125 

Gly Gly Gly Arg Gly Gly Arg Gly Ala Pro Pro Pro Pro Arg Gly Arg 
130 135 i 4 o 



Gly Ala Pro Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin Arg Gly Ala 
145 150 155 i 60 

Pro Leu Gly Pro Pro Arg Gly Ser Arg Gly Gly Arg Gly Gly Pro Ala 
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165 

Gin Gin Gin Arg Gly Arg Gly Ser 
180 

Gly Asn Val Gly Gly Lys Arg Lys 
195 200 

Ser Lys Arg Arg Gin Pro Thr Thr 
210 215 

Ser Leu Ser Ser Arg Phe Ser Lys 
225 230 

Val Thr lie Met Thr Thr Arg Asn 
245 

Ser Gly Ser Arg Gin Val Arg Ala 
260 



170 175 

Arg Gly Ser Arg Gly Asn Arg Gly 
185 190 

Ala Asp Gly Tyr Asn Gin Pro Asp 
205 

Asn Arg Thr Gly Val Pro Asn Pro 
220 

Val Val Thr lie Leu Val Thr Met 
235 240 

Phe lie Arg lie Leu Met Gly Asn 
250 255 



<210> 281 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 281 

Arg Met Asn Gly Lys Glu He Glu Gly Glu Glu He Glu He Val Leu 
1 5 10 15 

Ala Lys Pro Pro Asp Lys Lys Arg Lys Glu Arg 
20 25 



<210> 282 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 282 

Tyr Tyr His Pro Pro Pro Arg Met Pro Pro Pro lie Arg Gly Arg Gly 
1 5 10 15 

Arg Gly Gly Gly Arg Gly Gly Tyr Gly 
20 25 



<210> 283 
<211> 26 
<212> PRT 

<213> Homo sapiens 



<400> 283 

Asp Tyr Arg Gly Gly Tyr Glu Asp Pro Tyr Tyr Gly Tyr Asp Asp Gly 
15 10 15 
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Tyr Ala Val Arg Gly Arg Gly Gly Gly Arg 
20 25 



<210> 284 
<211> 28 
<212> PRT 

<213> Homo sapiens 



<400> 284 

Pro Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin Arg Gly Ala Pro Leu 



10 15 



Gly Pro Pro Arg Gly Ser Arg Gly Gly Arg Gly Gly 
20 25 



<210> 285 
<211> 35 
<212> PRT 

< 2 1 3 > Homo s ap i en s 
<400> 285 

Ala Asp Gly Tyr Asn Gin Pro Asp Ser Lys Arg Arg Gin Pro Thr Thr 

5 " 15 

Asn Arg Thr Gly Val Pro Asn Pro Ser Leu Ser Ser Arg Phe Ser Lys 
20 25 30 

Val Val Thr 

35 



<210> 286 
<211> 19 
<212> PRT 

<213> Homo sapiens 



<400> 286 

Leu Gin He Pro Pro Ser Ser Gin Ser Leu Gly Leu Lys Asn Ala Asp 



10 15 



Ser Ser He 



<210> 287 
<211> 129 
<212> PRT 

<213> Homo sapiens 



<400> 287 

Gly Gly Pro Pro Glu Ser Ala Pro Trp Leu Pro Ala Val Leu Arg Ala 

5 " 15 
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Pro Val Leu Thr Ser Arg Cys Ala Ser Ser Asp Ser Glu Gly Pro Val 
20 25 30 

Trp Phe Cys Gin Pro Gly Ser Gly Pro Ser Ser Thr Glu Met Ser Cys 
35 40 45 

His Cys He Leu Gly Pro Gly Ser Ser Cys Leu Cys Val Leu Arg Gly 
50 55 60 

Ser Met Trp Thr Pro Ser Val Pro Gly Trp Pro Gin Pro Ala Lys Glu 
65 70 75 80 

Thr Gly Ala Ser Ser Cys Ser Val Phe Ser Ala Asn Asn Gly Ser Cys 
85 90 95 

Pro Leu Pro Leu His Asn His Gin Arg Gin Ala Ser Leu Asp Thr Gly 
100 105 110 

Leu Ser Leu Glu His Val Pro Gly Glu Ser Tyr Phe Tyr Ser Pro Val 
115 120 125 

Gly 



<210> 288 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 288 

Ser Ser Asp Ser Glu Gly Pro Val Trp Phe Cys Gin Pro Gly Ser Gly 
15 10 15 

Pro Ser Ser Thr Glu Met Ser Cys His Cys He Leu Gly Pro Gly Ser 
20 25 30 

Ser Cys 



<210> 289 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 289 

Trp Thr Pro Ser Val Pro Gly Trp Pro Gin Pro Ala Lys Glu Thr Gly 
15 10 15 

Ala Ser Ser Cys Ser Val Phe Ser Ala Asn Asn Gly 
20 25 
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<210> 290 

<211> 21 

<212> PRT 

<213> Homo sapiens 



<400> 290 

Gin Arg Gin Ala Ser Leu Asp Thr Gly Leu Ser Leu Glu His Val Pro 

15 10 15 

Gly Glu Ser Tyr Phe 
20 



<210> 291 
<211> 29 
<212> PRT 

<213> Homo sapiens 



<400> 291 

Ser Ser Ser Leu Val Leu Thr lie 
1 5 

Ser Phe lie His Ser Thr Ser lie 
20 



Arg Ser Gin Thr Leu Phe Leu Ala 
10 15 

Phe Cys Ala Leu Asn 
25 



<210> 292 

<211> 12 

<212> PRT 

<213> Homo sapiens 



<400> 292 

Cys Cys Cys Arg Leu Gly Leu Ser Gly Pro Lys Cys 
1 5 10 



<210> 293 

<211> 22 

<212> PRT 

<213> Homo sapiens 



<400> 293 

Arg Ala Phe Trp Gly Leu Gly Ala 
1 5 

Asn Gin Leu Glu Ala Leu 
20 



Leu Gin Leu Leu Asp Leu Ser Ala 
10 is 



<210> 294 

<211> 34 

<212> PRT 

<213> Homo sapiens 



<400> 294 
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His Ala Ser Gly Arg Arg Thr Gly Ser Ala Asp Asp Gly Leu Gin Gly 
1 5 10 15 

Arg Thr Gly Ser Gly Pro Pro Thr Ala Gly Ala Gly Gly Gly Gly Ala 
20 25 30 

Ala Pro 



<210> 295 
<211> 205 
<212> PRT 
<213> Homo sapiens 

<400> 295 

Val Ser Ala Ala Ala Gly Ala Arg Leu Ala Pro Arg Ala Pro Gly Ala 
15 10 15 

Pro Ala Gly Cys Arg Pro Met Arg Gly Cys Ala Ala Arg Ala Ala Ala 
20 25 30 

Arg Lys Ser Leu Val Pro Val Leu Pro Ala Gly Trp Arg Ser Gly Pro 
35 40 45 

Ala Ala Ala Ala Arg Pro Gly Pro Arg Arg Leu Ala His Ala Pro Ser 
50 55 60 

Ala Ala Arg Ser Arg Ala Gly Pro Gly Ala Val Ala Arg Pro Leu Pro 
65 70 75 80 

Arg Arg His Leu Ala Ala Ala His Gly Arg Gly Cys Gly Pro Ala Ala 
85 90 95 

Ala Arg Ala Gly Ala Gly Ser Gly Pro Gly Ala Arg Arg Ala Ala Arg 
100 105 110 

Val Pro Thr Ala Gly Arg Pro Pro Gly Thr His Val His Thr Ser Gly 
115 120 125 

Gin Ser Gly Ala Pro Arg Asp Pro Glu Gly Glu Ala Leu Ala Asp Thr 
130 135 140 

Trp Ala Gin Thr Gly Gin Gly Asp Ser Ser Ser Asn Ser Ser Ser Ser 
145 150 155 160 

Gly Arg Gly Arg Asp Gin Glu Gly Pro Arg Met Gly Ala Ala Pro Pro 
165 170 175 

Pro Pro Ala Pro Ala Val Gly Gly Pro Leu Pro Val Arg Pro Trp Ser 
180 185 190 



Pro Ser Ser Ala Glu Pro Val Leu Arg Pro Asp Ala Trp 
195 200 205 
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<210> 296 
<211> 368 
<212> PRT 
<213> Homo sapiens 

<400> 296 

Thr Arg Pro Ala Ala Glu Arg Ala Pro Arg Thr Thr Gly Ser Arg Asp 

5 io 15 . 

Ala Gin Ala Ala Gly Leu Pro Pro Arg Val Pro Gly Ala Gly Gly Leu 
20 25 30 

Pro Pro Cys Gly Ala Leu Pro Gly Arg Gly Leu Gly Arg Cys Cys Cys 
35 40 



45 



Pro 



Cys Cys Cys Cys Cys Arg Leu Gly Leu Ser Gly Pro Lys Cys Arg 

55 60 

Gly Pro Arg Pro Arg Gly Pro Trp Ala Pro Arg Thr Ala Pro Arg Cys 

70 75 80 

Ala Arg Ala Cys Arg Glu Ala Cys Gin Leu Ser Ala Leu Ser Leu Pro 
85 90 95 

Ala Val Pro Pro Gly Leu Ser Leu Arg Leu Arg Ala Leu Leu Leu Asp 
100 105 110 

His Asn Arg Val Arg Ala Leu Pro Pro Gly Ala Phe Ala Gly Ala 



US 120 



Gly 



Ala Leu Gin Arg Leu Asp Leu Arg Glu Asn Gly Leu His Ser Val His 

140 



130 135 



125 
His 

Val Arg Ala Phe Trp Gly Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser 

150 155 160 

Ala Asn Gin Leu Glu Ala Leu Ala Pro Gly Thr Phe Ala Pro Leu Arg 
165 170 175 

Ala Leu Arg Asn Leu Ser Leu Ala Gly Asn Arg Leu Al 



180 185 



a Arg Leu Glu 
190 



Pro Ala Ala Leu Gly Ala Leu Pro Leu Leu Arg Ser Leu Ser 



195 200 



Leu Gin 
205 



Asp Asn Glu Leu Ala Ala Leu Ala Pro Gly Leu Leu Gly Arg Leu Pro 

2X5 220 

Ala Leu Asp Ala Leu His Leu Arg Gly Asn Pro Trp Gly Cys Gly Cys 

230 235 240 

Ala Leu Arg Pro Leu Cys Ala Trp Leu Arg Arg His Pro Leu Pro Ala 
245 250 255 
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Ser Glu Ala Glu Thr Val Leu Cys 
260 

Ser Pro Leu Thr Ala Phe Ser Asp 
275 280 

Pro Leu Ala Leu Arg Asp Leu Ala 
290 295 

Leu Leu Pro Arg Gin Pro Gly Phe 
305 310 

His Arg Leu Pro Cys Ala Pro Pro 
325 

Pro Ala Glu Thr Val Gin Thr Arg 
340 

Val Pro Arg Pro Arg Thr Arg Gly 
355 360 



Val Trp Pro Gly Arg Leu Thr Leu 
265 270 

Ala Ala Phe Ser His Cys Ala Gin 
285 

Arg Gly Leu His Ala Arg Ala Gly 
300 

Leu Pro Gly Ala Gly Leu Trp Ala 
315 320 

Pro Pro Pro His Arg Arg Pro Pro 
330 335 

Thr Pro lie Pro Thr Pro Thr Ala 
345 350 

Ala Pro Ser Ala Ala Ala Gin Ala 
365 



<210> 297 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<400> 297 

Gly Cys Arg Pro Met Arg Gly Cys 
1 5 

Ser Leu Val Pro Val Leu Pro Ala 
20 

Ala Ala Arg Pro Gly Pro Arg Arg 
35 40 



Ala Ala Arg Ala Ala Ala Arg Lys 
10 15 

Gly Trp Arg Ser Gly Pro Ala Ala 
25 30 

Leu Ala His Ala Pro Ser Ala 
45 



<210> 298 
<211> 30 
<212> PRT 

<213> Homo sapiens 
<400> 298 

Pro Gly Ala Val Ala Arg Pro Leu 
1 5 

His Gly Arg Gly Cys Gly Pro Ala 
20 



Pro Arg Arg His Leu Ala Ala Ala 
10 15 

Ala Ala Arg Ala Gly Ala 
25 30 



<210> 299 
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<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 299 

Ser Gly Gin Ser Gly Ala Pro Arg 
1 5 

Asp Thr Trp Ala Gin Thr Gly Gin 
20 



Asp Pro Glu Gly Glu Ala Leu Ala 
10 15 



<210> 300 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 300 

Pro Pro Ala Pro Ala Val Gly Gly 
1 5 

Pro Ser Ser Ala Glu Pro Val 
20 



Pro Leu Pro Val Arg Pro Trp Ser 
10 15 



<210> 301 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 301 

Ala Pro Arg Thr Thr Gly Ser Arg 
1 5 

Pro Arg Val Pro Gly Ala Gly Gly 
20 



Asp Ala Gin Ala Ala Gly Leu Pro 
10 15 

Leu Pro 
25 



<210> 302 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 302 

Gly Pro Arg Pro Arg Gly Pro Trp 
1 5 

Ala Arg Ala Cys Arg Glu 
20 



Ala Pro Arg Thr Ala Pro Arg Cys 
10 15 



<210> 303 
<211> 31 
<212> PRT 

<213> Homo sapiens 
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<400> 303 

Ala Val Pro Pro Gly Leu Ser Leu Arg Leu Arg Ala Leu Leu Leu Asp 
15 10 15 

His Asn Arg Val Arg Ala Leu Pro Pro Gly Ala Phe Ala Gly Ala 
20 25 30 



<210> 304 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 304 

Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser Ala Asn Gin Leu Glu Ala 
1 5 10 15 

Leu Ala Pro Gly Thr Phe Ala Pro 
20 



<210> 305 

<211> 36 

<212> PRT 

<213> Homo sapiens 

<400> 305 

Pro Pro Gly Ala Phe Ala Gly Ala Gly Ala Leu Gin Arg Leu Asp Leu 
15 10 15 

Arg Glu Asn Gly Leu His Ser Val His Val Arg Ala Phe Trp Gly Leu 
20 25 30 

Gly Ala Leu Gin 
35 



<210> 306 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 306 

Arg Asn Leu Ser Leu Ala Gly Asn Arg Leu Ala Arg Leu Glu Pro Ala 
15 10 15 

Ala Leu Gly Ala Leu Pro Leu Leu Arg Ser Leu Ser 
20 25 



<210> 307 
<211> 26 
<212> PRT 

<213> Homo sapiens 
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<400> 307 

Leu Pro Ala Leu Asp Ala Leu His Leu Arg Gly Asn Pro Trp Gly Cys 
1 5 10 15 

Gly Cys Ala Leu Arg Pro Leu Cys Ala Trp 
20 25 



<210> 308 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 308 

Thr Val Leu Cys Val Trp Pro Gly 
1 5 

Ala Phe Ser Asp Ala Ala Phe Ser 
20 

Arg Asp 



Arg Leu Thr Leu Ser Pro Leu Thr 
10 15 

His Cys Ala Gin Pro Leu Ala Leu 
25 30 



<210> 309 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 309 

Leu His Ala Arg Ala Gly Leu Leu 
1 5 

Gly Ala Gly Leu Trp Ala His Arg 
20 



Pro Arg Gin Pro Gly Phe Leu Pro 
10 15 



<210> 310 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 310 

Thr Val Gin Thr Arg Thr Pro lie 
1 5 

Pro Arg Thr Arg Gly Ala Pro Ser 
20 



Pro Thr Pro Thr Ala Val Pro Arg 
10 15 



<210> 311 
<211> 59 
<212> PRT 

<213> Homo sapiens 
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<400> 311 

His Ala Ser Gly Arg Pro Asp Arg Ser Ser Ala Pro lie Gly Asn Ser 
15 10 15 

Gly Leu Pro Cys Pro Asp Leu Glu Pro Leu Gly Gly Leu Gin Ser Lys 
20 25 30 

Cys Arg Leu Cys Ala Pro Thr Glu Ala Arg Gly Leu Trp Ser Arg Ser 
35 40 45 

Leu Cys Ser Asp Arg Cys Asp Thr Trp Arg Ser 
50 55 



<210> 312 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 312 

Gly Leu Pro Cys Pro Asp Leu Glu Pro Leu Gly Gly Leu Gin Ser Lys 
15 10 15 

Cys Arg Leu Cys Ala Pro Thr Glu Ala Arg Gly Leu Trp 
20 25 



<210> 313 
<211> 16 
<212> PRT 

<213> Homo sapiens 



<400> 313 

Gin Glu Trp Glu Ser Glu Leu Gly Glu Arg Arg Lys Pro Leu Gin Ala 
15 10 15 



<210> 314 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<400> 314 

Cys Gin Ser Ser Asn Leu lie Phe Phe Gin Phe Val Asn lie Leu Phe 
15 10 15 

Asn Leu Met Met Asp lie Leu Val Asp Phe Ser lie Thr Lys Met Pro 
20 25 30 

He Asn Ser He Phe Ser Leu Tyr Phe Cys Tyr Glu He He 
35 40 45 
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<210> 315 
<211> 134 
<212> PRT 
<213> Homo sapiens 



<400> 315 
Gly Pro Val Trp 
1 

Gin Leu Phe Ser 
20 

Thr Thr Cys Leu 
35 

Ser Gin Trp Pro 
50 

Asp Ser Arg Val 
65 

Gly Lys Glu Glu 



Thr Gly Met Gly 
100 

Lys Leu Arg Asn 
115 



Leu Phe Cys Phe 
5 

Gin Glu Asn Ser 



Pro Pro Trp Phe 
40 

Pro Ser Ser Asp 
55 

Gly Pro Val Gin 
70 

Arg Glu Lys Asn 
85 

Leu Ala Asn Arg 



Gin Gly Lys Leu 
120 



Leu Thr Leu Cys 
10 

Cys Met Asp Val 
25 

Ser Arg Gly Ala 



His Gly Ala Val 
60 

Pro Ser His Leu 
75 

Lys Lys Ala Glu 
90 

lie Pro Arg Asp 
105 

Arg Thr Lys Glu 



Arg Lys Pro Ser 
15 

Ala Gly Gly Val 
30 

Pro Ala Gin Met 
45 

Arg Ala Gly Arg 



Thr Cys Glu Gly 
80 

Val Asn Pro Pro 
95 

Asp lie Thr Leu 
110 

Asn Arg Thr Gin 
125 



Ser Ala Lys Arg His Pro 
130 



<210> 316 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400> 316 

Val Ala Cys Lys Pro Glu Asn Arg Thr Lys Thr His Phe Ala Ser Ser 
15 10 15 

Pro Ala Cys Asp Gly His Ala Leu Gly Gly Gin Val Gly Phe Ala lie 
20 25 30 

Cys Phe Leu Ser Cys Leu Phe Pro Pro Met 
35 40 



<210> 317 
<211> 40 
<212> PRT 
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<213> Homo sapiens 
<400> 317 

Ser His Pro Met Pro Asn Thr Pro Gin Lys Gin Leu Leu Phe Ser Glu 
15 10 15 

Asp Asn Glu Leu Leu Val Ser Leu Arg Thr Gly Arg Lys Pro Thr Leu 
20 25 30 

Gin Ala Ala Leu Arg Val Thr Gly 
35 40 



<210> 318 

<211> 59 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 318 

Glu Gly Asp Pro Arg Gly Arg Pro Arg Pro Arg Pro Leu Gly Pro Pro 
1 5 10 15 

Pro Gin Leu Thr Leu Pro Thr Ala Leu Xaa Asp lie Leu Arg Gin Val 
20 25 30 

Arg Ala Pro Gly Leu Arg Leu Ser Arg Ala Leu Glu Val Gly Arg Lys 
35 40 45 

Gly Ser Pro lie Phe Lys lie Gin lie Tyr Leu 
50 55 



<210> 319 
<211> 250 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (145) 

<223> Xaa equals any of the naturally occurring L-amino acids 



<400> 319 

Ala His Arg Leu Gin lie Arg Leu 
1 5 

Leu Leu Arg Leu Arg His Pro Leu 
20 

Arg Ala His Gly Leu Glu Val Glu 



Leu Thr Trp Asp Val Lys Asp Thr 
10 15 

Gly Glu Ala Tyr Ala Thr Lys Ala 
25 30 

Pro Ser Ala Leu Glu Gin Gly Phe 



WO 99/47540 



163 



PCT/US99/05804 



35 40 45 

Arg Gin Ala Tyr Arg Ala Gin Ser His Ser Phe Pro Asn Tyr Gly Leu 
50 55 60 

Ser His Gly Leu Thr Ser Arg Gin Trp Trp Leu Asp Val Val Leu Gin 
65 70 75 80 

Thr Phe His Leu Ala Gly Val Gin Asp Ala Gin Ala Val Ala Pro lie 
85 90 95 

Ala Glu Gin Leu Tyr Lys Asp Phe Ser His Pro Cys Thr Trp Gin Val 
100 105 110 

Leu Asp Gly Ala Glu Asp Thr Leu Arg Glu Cys Arg Thr Arg Gly Leu 
115 120 125 

Arg Leu Ala Val lie Ser Asn Phe Asp Arg Arg Leu Glu Gly lie Leu 
130 135 140 

Xaa Gly Leu Gly Leu Arg Glu His Phe Asp Phe Val Leu Thr Ser Glu 
145 150 155 160 

Ala Ala Gly Trp Pro Lys Pro Asp Pro Arg lie Phe Gin Glu Ala Leu 
165 170 175 

Arg Leu Ala His Met Glu Pro Val Val Ala Ala His Val Gly Asp Asn 
180 185 190 

Tyr Leu Cys Asp Tyr Gin Gly Pro Arg Ala Val Gly Met His Ser Phe 
195 200 205 

Leu Val Val Gly Pro Gin Ala Leu Asp Pro Val Val Arg Asp Ser Val 
210 215 220 

Pro Lys Glu His lie Leu Pro Ser Leu Ala His Leu Leu Pro Ala Leu 
225 230 235 240 

Asp Cys Leu Glu Gly Ser Thr Pro Gly Leu 
245 250 



<210> 320 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 320 

lie Arg Leu Leu Thr Trp Asp Val Lys Asp Thr Leu Leu Arg Leu Arg 
15 10 15 

His Pro Leu Gly Glu Ala Tyr Ala Thr Lys Ala 
20 25 
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<210> 321 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 321 

Leu Glu Gin Gly Phe Arg Gin Ala Tyr Arg Ala Gin Ser His Ser Phe 
15 10 15 

Pro Asn Tyr Gly Leu Ser His Gly 
20 



<210> 322 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 322 

His Leu Ala Gly Val Gin Asp Ala Gin Ala Val Ala Pro He Ala Glu 
1 5 10 15 

Gin Leu Tyr Lys Asp Phe Ser His Pro Cys 
20 25 



<210> 323 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 323 

Val Leu Asp Gly Ala Glu Asp Thr Leu Arg Glu Cys Arg Thr Arg Gly 
1 5 10 15 

Leu Arg Leu Ala Val He Ser 
20 



<210> 324 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 324 

Arg Glu His Phe Asp Phe Val Leu Thr Ser Glu Ala Ala Gly Trp Pro 
15 10 15 

Lys Pro Asp Pro Arg He Phe Gin Glu Ala 
20 25 



<210> 325 
<211> 28 
<212> PRT 



WO 99/47540 



PCT/US99/05804 



165 



<213> Homo sapiens 
<400> 325 

Glu Pro Val Val Ala Ala His Val 
1 5 

Gin Gly Pro Arg Ala Val Gly Met 
20 



Gly Asp Asn Tyr Leu Cys Asp Tyr 
10 15 

His Ser Phe Leu 
25 



<210> 326 

<211> 23 

<212> PRT 

<213> Homo sapiens 



<400> 326 

Val Val Arg Asp Ser Val Pro Lys 
1 5 

His Leu Leu Pro Ala Leu Asp 
20 



Glu His lie Leu Pro Ser Leu Ala 
10 15 



<210> 327 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 327 

lie Arg Lys Leu Gly Pro Gly Leu 
1 5 

Gin Val Phe Pro Arg Val 
20 



Ala Pro Cys Ser Cys Arg Ser Gly 
10 15 



<210> 328 
<211> 241 
<212> PRT 

<213> Homo sapiens 



<400> 328 

Lys Pro Leu Arg Met Ala Arg Pro 
1 5 

Ala Leu Val Ser Ala Trp His Ser 
20 

Gly Leu Arg His Gin Asp Asp Phe 

35 40 

Cys Ala Ala Pro Phe Glu Glu Gin 
50 55 

Arg Leu Gin Phe Phe Val Val Leu 



Gly Gly Pro Glu His Asn Glu Tyr 
10 15 

Ser Gly Ser Tyr Leu Asp Ser Glu 
25 30 

Asp Val Ser Leu Leu Val Cys His 
45 

Gly Glu Ala Glu Arg His Val Leu 
60 

Thr Ser Gin Arg Glu Leu Phe Pro 
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65 



70 75 80 



Arg Leu Thr Ala Asp Met Arg Arg Phe Arg Lys Pro Pro Arg Leu Pro 
85 90 95 

Pro Glu Pro Glu Ala Pro Gly Ser Ser Ala Gly Ser Pro Gly Glu Ala 
100 105 110 

Ser Gly Leu He Leu Ala Pro Gly Pro Ala Pro Leu Phe Pro Pro Leu 
115 120 125 

Ala Ala Glu Val Gly Met Ala Arg Ala Arg Leu Ala Gin Leu Val Arg 
130 135 140 

Leu Ala Gly Gly His Cys Arg Arg Asp Thr Leu Trp Lys Arg Leu Phe 
145 150 155 160 

Leu Leu Glu Pro Pro Gly Pro Asp Arg Leu Arg Leu Gly Gly Arg Leu 
165 170 175 

Ala Leu Ala Glu Leu Glu Glu Leu Leu Glu Ala Val His Ala Lys Ser 
180 185 190 

He Gly Asp He Asp Pro Gin Leu Asp Cys Phe Leu Ser Met Thr Val 
195 200 205 

Ser Trp Tyr Gin Ser Leu He Lys Val Leu Leu Ser Arg Phe Pro Arg 
210 215 220 

Ala Val Ala He Ser Lys Ala Gin Thr Trp Glu Leu Ser Thr Trp Leu 
225 230 235 240 

Arg 



<210> 329 
<211> 30 
<212> PRT 

<213> Homo sapiens 
<400> 329 

Ala Arg Gly Thr Leu Glu Leu Pro 
1 5 

Leu Tyr Asn Tyr Val Ala Asp His 
20 



Thr Pro Leu He Ala Ala His Gin 
10 15 

Ala Ser Ser Tyr His Met 
25 30 



<210> 330 
<211> 37 
<212> PRT 

<213> Homo sapiens 



<400> 330 
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Ser His Cys Glu Trp Pro Gly Gin 
1 5 

Pro Trp Cys Arg His Gly Thr Val 
20 

Asp Phe Asp Thr Arg 
35 
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Gly Ala Gin Asn Thr Thr Ser Met 
10 15 

Leu Ala Pro Thr Trp Thr Leu Arg 
25 30 



<210> 331 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<40O> 331 

Pro Leu Thr Thr Val Ser His Leu Cys Pro Leu Ser Leu Arg Val Phe 
15 10 15 

Thr Ser His Leu Asp lie Thr Ala Gly His Ser His Arg Asp Asp Thr 
20 25 30 

Trp Val Pro lie Pro Ala Leu Pro Leu Lys His Leu Arg Pro Pro Ser 
35 40 45 

Ser Pro Phe Ala Leu Gly Pro Trp Val Ser His Pro Leu Met Arg Trp 
50 55 60 

Val Gin Lys Leu Ser His Leu His Ser Asn Pro Gly Thr Gly Phe Ser 
65 70 75 80 

Met Gly Gly Lys Ser Ala Glu Lys Leu Lys Cys 
85 90 



<210> 332 

<211> 179 

<212> PRT 

<213> Homo sapiens 

<400> 332 

Ser Thr Ala Ala Arg Gly Ala Pro Gly Pro Gly Arg Ala Gly Gly Thr 
1 5 10 15 

Pro Arg Ser Ser Pro Cys Gin lie His Trp Gly His Arg Pro Pro Ala 
20 25 ~ 30 

Gly Leu Leu Pro lie His Asp Gly Leu Leu Val Pro Glu Pro Asp Gin 
35 40 45 

Ser Ser Pro Lys Pro Leu Pro Gin Ser Cys Arg His Phe Gin Ser Pro 
50 55 60 

Asp Leu Gly Thr Gin Tyr Leu Val Ala Leu Asn Gin Lys Phe Thr Asp 
65 70 75 80 
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Cys Ser Ala Leu Val Phe Trp Thr Pro Leu Arg Lys Asp Val Ser Glu 
85 90 95 

Val Val Phe Arg Glu Ala Leu Pro Val Gin Pro Gin Asp Thr Arg Ser 
100 105 HO 

Pro Pro Ala Gin Leu Val Ser Thr Tyr His His Leu Glu Ser Val lie 
115 120 125 

Asn Thr Ala Cys Phe Thr Leu Leu Asp Pro Pro Pro Leu Lys Gly Val 
130 135 140 

Asp Trp Thr Thr Glu Cys His Cys Ser Leu Asn His Gly Pro Thr Arg 
145 150 155 160 

Leu Pro Ala Arg Gly Arg Thr Asp Gin Pro Phe Trp Ala Pro Gly Gin 
165 170 175 

Ala Arg His 



<210> 333 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 333 

His Gin Arg Leu Cys Asn Tyr Val Leu Arg Val Cys Cys Pro Ser Leu 
15 10 15 

Ala Ala Gly Thr Ala Leu Pro Lys His Pro Gin Pro Leu Thr His Pro 
20 25 30 

Gly Leu Gin Arg Val Arg Ser Thr Pro Arg Thr Pro Trp Ala Leu Leu 
35 40 45 

Gly Tyr Ser Phe Arg Pro Pro Trp 
50 55 



<210> 334 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 334 

Pro Gly Gly Pro Glu His Asn Glu Tyr Ala Leu Val Ser Ala Trp His 
1 ^ ^ 5 10 15 



Ser Ser Gly Ser Tyr Leu Asp Ser Glu Gly Leu Arg 
20 25 
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<210> 335 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 335 

Asp Val Ser Leu Leu Val Cys His 
1 5 

Gly Glu Ala Glu Arg His Val Leu 
20 
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Cys Ala Ala Pro Phe Glu Glu Gin 
10 15 

Arg 
25 



<210> 336 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Arg Leu Thr Ala Asp Met Arg Arg 
1 5 

Pro Glu Pro Glu Ala Pro Gly Ser 
20 



Phe Arg Lys Pro Pro Arg Leu Pro 
10 is 

Ser Ala Gly Ser 
25 



<210> 337 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 337 

Gly Glu Ala Ser Gly Leu lie Leu Ala Pro Gly Pro Ala Pro Leu Phe 
15 10 15 

Pro Pro Leu Ala Ala Glu Val Gly Met 
20 25 



<210> 338 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Thr Leu Trp Lys Arg Leu Phe Leu 
1 5 

Leu Arg Leu Gly Gly Arg Leu 
20 



Leu Glu Pro Pro Gly Pro Asp Arg 
10 15 



<210> 339 
<211> 28 
<212> PRT 
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<213> Homo sapiens 
<400> 339 

Leu Ala Glu Leu Glu Glu Leu Leu Glu Ala Val His Ala Lys Ser lie 
x 5 10 15 

Gly Asp He Asp Pro Gin Leu Asp Cys Phe Leu Ser 
20 25 



<210> 340 
<211> 197 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (97) t 

<223> Xaa equals any of the naturally occurring L-ammo acids 

<400> 340 

Phe Gin Leu Tyr Phe Asn Pro Glu Leu He Phe Lys His Phe Gin lie 
15 10 I 5 

Trp Arg Leu He Thr Asn Phe Leu Phe Phe Gly Pro Val Gly Phe Asn 
20 25 30 

Phe Leu Phe Asn Met He Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu 
35 40 45 

Glu Gly Ser Phe Arg Gly Arg Thr Ala Asp Phe Val Phe Met Phe Leu 
50 55 60 

Phe Gly Gly Phe Leu Met Thr Leu Phe Gly Leu Phe Val Ser Leu Val 
65 70 75 80 

Phe Leu Gly Gin Ala Phe Thr He Met Leu Val Tyr Val Trp Ser Arg 
85 90 95 

Xaa Asn Pro Tyr Val Arg Met Asn Phe Phe Gly Leu Leu Asn Phe Gin 
100 105 HO 

Ala Pro Phe Leu Pro Trp Val Leu Met Gly Phe Ser Leu Leu Leu Gly 
115 120 125 

Asn Ser He He Val Asp Leu Leu Gly He Ala Val Gly His He Tyr 
130 135 140 

Phe Phe Leu Glu Asp Val Phe Pro Asn Gin Pro Gly Gly He Arg He 
145 ' 150 155 160 

Leu Lys Thr Pro Ser He Leu Lys Ala He Phe Asp Thr Pro Asp Glu 
165 170 175 



Asp 



Pro Asn Tyr Asn Pro Leu Pro Glu Glu Arg Pro Gly Gly Phe Ala 
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180 



Trp Gly Glu Gly Gin 
195 



185 



190 



<210> 341 
<211> 108 
<212> PRT 
<213> Homo sapiens 



<400> 341 

Gly Val Gly Gin Ala Thr Val Gly 
1 5 

Leu Glu Tyr Leu Gin lie Pro Pro 
20 

Cys Val Leu Thr Thr Ala Ala Val 
35 40 

Gin Leu Tyr Phe Asn Pro Glu Leu 
50 55 

Arg Leu lie Thr Asn Phe Leu Phe 
65 70 

Leu Phe Asn Met lie Phe Leu Tyr 
85 

Gly Ser Phe Arg Gly Arg Thr Ala 
100 



Lys Met Ala Tyr Gin Ser Leu Arg 
10 15 

Val Ser Arg Ala Tyr Thr Thr Ala 
25 30 

Gin Leu Glu Leu lie Thr Pro Phe 
45 

lie Phe Lys His Phe Gin lie Trp 
60 

Phe Gly Pro Val Gly Phe Asn Phe 

75 80 

Arg Tyr Cys Arg Met Leu Glu Glu 
90 95 

Asp Phe Val Phe 
105 



<210> 342 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 342 

Leu lie Phe Lys His Phe Gin lie Trp Arg Leu lie Thr Asn Phe Leu 
15 10 15 

Phe Phe Gly Pro Val Gly Phe 
20 



<210> 343 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 343 

Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu Glu Gly Ser Phe Arg Gly 
15 10 15 
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Arg Thr Ala Asp Phe Val Phe Met Phe 
20 25 



<210> 344 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (19) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 344 9 _ m 

Leu Val Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val Trp 
1 5 10 15 

Ser Arg Xaa Asn Pro Tyr Val 
20 



<210> 345 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 345 , „ , * 

Val Leu Met Gly Phe Ser Leu Leu Leu Gly Asn Ser He He Val Asp 
15 10 I 5 

Leu Leu Gly He Ala 
20 



<210> 346 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 346 

Asn Gin Pro Gly Gly He Arg He Leu Lys Thr Pro Ser He Leu Lys 
15 10 15 

Ala He Phe Asp Thr Pro Asp Glu Asp 
20 25 



<210> 347 

<211> 28 

<212> PRT 

<213> Homo sapiens 



<400> 347 
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Arg Leu Glu Tyr Leu Gin He Pro Pro Val Ser Arg Ala Tyr Thr Thr 
1 5 10 15 

Ala Cys Val Leu Thr Thr Ala Ala Val Gin Leu Glu 
20 25 



<210> 348 
<211> 31 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Arg Leu He Thr Asn Phe Leu Phe Phe Gly Pro Val Gly Phe Asn Phe 
15 10 15 

Leu Phe Asn Met lie Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu 
20 25 30 



<210> 349 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 349 

His Ala Ser Ala Gly Pro Asp Gly Ser Ser Pro Ala 
1 5 10 



<210> 350 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<400> 350 

Glu Leu Leu Leu Glu Lys Pro Lys Pro Trp Gin Pro Pro Ala Ala Ala 
15 10 15 

Pro His Arg Ala Leu Leu Val Leu Cys Tyr Ser lie Val Glu Asn Thr 
20 25 30 

Cys He He Thr Pro Thr Ala Lys Ala Trp Lys Tyr Met Glu Glu Glu 
35 40 45 

He Leu Gly Phe Gly Lys Ser Val Cys Asp Ser Leu Gly Arg Arg His 
50 55 60 

Met Ser Thr Cys Ala Leu Cys Asp Phe Cys Ser Leu Lys Leu Glu Gin 
65 70 75 80 

Cys His Ser Glu Ala Ser Leu Gin Arg Gin Gin Cys Asp Thr Ser His 
85 90 95 

Lys Thr Pro Phe Ala Ala Pro Cys Leu Pro Pro Arg Ala Cys Pro Ser 
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100 105 HO 



Ala Thr Arg 
115 



<210> 351 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<400> 351 

Leu Pro Gly Trp Gly Phe Pro Thr Lys He Cys Asp Thr Asp Tyr lie 
1 5 10 15 

Gin Tyr Pro Asn Tyr Cys Ser Phe Lys Ser Gin Gin Cys Leu Met Arg 
20 25 30 

Asn Arg Asn Arg Lys Val Ser Arg Met Arg Cys Leu Gin Asn Glu Thr 
35 40 45 

Tyr Ser Ala Leu Ser Pro Gly Lys Ser Glu Asp Val Val Leu Arg Trp 
50 55 60 

Ser Gin Glu Phe Ser Thr Leu Thr Leu Gly Gin Phe Gly 
65 70 75 



<210> 352 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<400> 352 

Ser Pro Val Leu Leu Pro Ala Phe 
1 5 

Ala Leu Pro Val Ser Ala Pro Leu 
20 

Pro Ala Cys Ala Pro Leu Leu Ala 
35 40 

Pro Gly Phe Pro Gly Thr Arg Arg 
50 55 



Pro Pro Leu Pro Val Pro Leu Leu 
10 15 

Pro Ala Cys Val Leu Val Ser Ala 
25 30 

Pro Ala Cys Ala Leu Ala Leu Ala 
45 

He Val Gly Ala Leu Pro Arg Cys 
60 



Cys 
65 



<210> 353 
<211> 35 
<212> PRT 

<213> Homo sapiens 
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<400> 353 

Leu Leu Val Leu Cys Tyr Ser lie 
1 5 

Pro Thr Ala Lys Ala Trp Lys Tyr 
20 

Gly Lys Ser 
35 



Val Glu Asn Thr Cys lie lie Thr 

10 15 

Met Glu Glu Glu lie Leu Gly Phe 

25 30 



<210> 354 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 354 

Leu Lys Leu Glu Gin Cys His Ser 
1 5 

Cys. Asp Thr Ser His Lys Thr Pro 
20 



Glu Ala Ser Leu Gin Arg Gin Gin 
10 15 

Phe Ala 
25 



<210> 355 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 355 

Gin Val Ser Gly Leu lie Leu Ser Leu Ser Cys Gly Met Asp Gly Leu 
15 10 15 

Ala Leu Asp Gly Ser Pro Ser Pro Ser Pro Xaa Thr Glu Lys Ala Gly 
20 25 30 

Arg Cys lie Ser Gin Thr Ser Leu 
35 40 



<210> 356 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 356 

Gin Val Ser Gly Leu lie Leu Ser 
1 5 

Ala Leu Asp Gly Ser Pro Ser Pro 
20 

Arg Cys He Ser Gin Thr Ser Leu 

35 40 



Leu Ser Cys Gly Met Asp Gly Leu 
10 15 

Ser Pro Xaa Thr Glu Lys Ala Gly 
25 30 

Pro Gly Lys Trp Glu Val 
45 



<210> 357 
<211> 173 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (118) . 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 357 , „ 

Arg Ala Ser Lys Thr Val Pro Arg Met Pro Pro Asn Trp Pro Ala Lys 
15 10 I 5 

Met Pro Cys Leu Cys His He Arg Thr Val Glu His Leu Gly Thr He 
20 25 30 

Ser Ser Gly Ala Pro Gly Arg Pro Thr Gly Gin Gin Ala Ala Arg Thr 
35 40 45 

Tyr His He Cys Trp He His Pro Gly Gin Lys He Asp Ser Leu Pro 
50 55 60 

Pro Ser Ser Gin His Pro Arg Ser Gin Gin Leu Ala Pro Gly Thr Trp 
65 70 75 80 

Pro Ser Thr Ser Thr Thr Lys Pro Ala Glu Glu Thr Leu Gly Ser Ser 
85 90 95 

Ala Ser Leu Pro He Ser Gin Ala Arg Lys Ser Glu Lys Cys Thr Phe 
100 105 HO 

Gin Pro Ser Pro Trp Xaa Val Arg Gly Lys Glu Ser His Gin Val Pro 
115 120 125 

Ala His Pro Ser His Arg Thr Glu Thr Glu Ser Asp His Ser Pro Val 
130 135 140 

Arg Lys Pro Pro Ser Arg Gly Thr Arg Thr Gly Asp Phe Thr Val Gly 
14 5 150 155 . 160 

Asp Trp Ser Glu Ala Trp Leu Leu Glu Leu Ala Leu Leu 
165 170 



WO 99/47540 



177 



PCT/US99/05804 



<210> 358 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 358 

Arg Met Pro Pro Asn Trp Pro Ala Lys Met Pro Cys Leu Cys His lie 
1 5 10 15 

Arg Thr Val Glu His Leu Gly 
20 



<210> 359 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 359 

Gly Arg Pro Thr Gly Gin Gin Ala Ala Arg Thr Tyr His lie Cys Trp 

■ 1 5 10 15 

lie His Pro Gly Gin Lys lie Asp Ser 
20 25 



<210> 360 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 360 

Trp Pro Ser Thr Ser Thr Thr Lys Pro Ala Glu Glu Thr Leu Gly Ser 
15 10 15 

Ser Ala Ser Leu Pro lie Ser Gin Ala 
20 25 



<210> 361 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 361 

Lys Ser Glu Lys Cys Thr Phe Gin Pro Ser Pro Trp Xaa Val Arg Gly 
1 5 10 15 



Lys Glu Ser His Gin Val Pro 
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20 



<210> 362 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 362 

Lys Pro Pro Ser Arg Gly Thr Arg Thr Gly Asp Phe Thr Val Gly Asp 
15 10 15 

Trp Ser Glu Ala Trp Leu Leu Glu 
20 



<210> 363 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Pro Cys Ala Asp Cys Leu Ser Ala Trp Ala 
15 10 



<210> 364 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 364 

His Ala Ser Gly Tyr Leu Cys lie Val Leu Leu 
15 10 



<210> 365 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 365 

Asn Ser Ala Arg Ala Ala Arg Ala Glu lie Val Leu Gly Leu Leu Val 
15 10 15 

Trp Thr Leu lie Ala Gly Thr Glu Tyr Phe Arg Val Pro Ala Phe Gly 
20 25 30 

Trp Val 



<210> 366 
<211> 22 
<212> PRT 
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<213> Homo sapiens 
<400> 366 

Pro Cys Ser Pro Pro Asp Ser Pro Pro Leu Pro Gly Ala Phe Val Trp 
1 5 10 15 

Arg Val Leu Trp Val Cys 
20 



<210> 367 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 367 

Ala Arg Ala Cys Phe Ala Tyr Asn 
1 5 

Trp Asp Ser His Phe His Gly Ser 
20 



Gly Val Cys Ser Glu Gly Arg Cys 
10 15 

Val 
25 



<210> 368 
<211> 100 
<212> PRT 

<213> Homo sapiens 



<400> 368 
Met Ser Asn Met 
1 

Asn Lys Tyr lie 
20 

Lys Ser Thr Val 
35 

Lys Asn Lys Met 
50 

Gin lie Asp lie 
65 



Ser Lys Arg Tyr 



Leu Ser His Pro 
100 



Gly Lys lie Pro 
5 

Cys Ser Arg lie 



Leu Gin lie Cys 
40 

Ser Asp His Ser 
55 

His Ser Leu Gly 
70 

Cys Thr Leu Leu 
85 



Ser Leu Ser Leu 
10 

Pro Lys Phe lie 
25 

Leu Lys Arg Gin 



Lys lie Gly Lys 
60 

He Val Glu Thr 
75 

Thr Glu Gin Ser 
90 



His He Pro He 
15 

Gin Lys Val Asn 
30 

He lie Leu Asn 
45 

Ala Asn Leu Val 



Gly Cys Val Pro 
80 

Gly Phe Pro Phe 
95 



<210> 369 
<211> 84 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (82) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 369 

Met Ala Gly Cys Cys Leu Lys Leu Phe Gly Val Leu Ser Leu Cys Phe 
! 5 10 15 

Leu Cys Gly Leu lie Ser lie Glu Arg Val lie Cys Asn Pro Val Ser 
20 25 30 

Ala Asp Phe Gin Val Ser Thr Phe Cys Gin Arg His Cys Leu Leu Arg 
35 40 45 

Ser Lys Val Met Phe Xaa He Lys Gly Xaa Thr Ala Thr He Glu Val 
50 55 60 

He Asn Glu Asn Cys Thr Leu Val Ala Ala Pro Pro He Gly Phe Pro 
65 70 75 80 

He Xaa Phe Leu 



<210> 370 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<400> 370 

Met Ser Asp His Ser Lys He Gly 
1 5 

He His Ser Leu Gly He Val Glu 
20 

Tyr Cys Thr Leu Leu Thr Glu Gin 
3 5 40 



Lys Ala Asn Leu Val Gin He Asp 
10 15 

Thr Gly Cys Val Pro Ser Lys Arg 
25 30 

Ser Gly Phe Pro Phe Leu Ser His 
45 



Pro 
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<210> 371 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 371 

Met Ala Gly Cys Cys Leu Lys Leu 
1 5 

Leu Cys Gly Leu lie Ser lie Glu 
20 

Ala Asp Phe Gin Val Ser Thr Phe 
35 40 

Ser Lys 
50 



Phe Gly Val Leu Ser Leu Cys Phe 
10 15 

Arg Val lie Cys Asn Pro Val Ser 
25 30 

Cys Gin Arg His Cys Leu Leu Arg 
45 



<210> 372 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (4) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (8) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (32) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 372 

Val Met Phe Xaa lie Lys Gly Xaa Thr Ala Thr He Glu Val He Asn 
15 10 15 

Glu Asn Cys Thr Leu Val Ala Ala Pro Pro He Gly Phe Pro He Xaa 
20 25 30 

Phe Leu 



<210> 373 
<211> 65 
<212> PRT 

<213> Homo sapiens 
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<400> 373 

Pro Thr Glu Gly Arg Gin Lys Val 
1 5 

Ser Ala Leu Ala Met Thr Lys Thr 
20 

Val Leu Ser Trp Tyr Thr Phe Leu 
35 40 

Lys Asp Glu Val Lys Pro Lys lie 
50 55 



Leu Lys Thr Phe Thr Val Pro Arg 
10 15 

Ser Thr Cys lie Tyr His Phe Leu 
25 30 

Asn Tyr Tyr lie Ser Gin Glu Gly 
45 

Leu Ala Asn Gly Ala Arg Trp Lys 
60 



Tyr 
65 



<210> 374 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 374 

Pro Arg Ser Ala Leu Ala Met Thr 

1 " 5 

Phe Leu Val Leu Ser Trp Tyr Thr 
20 

Glu Gly Lys 
35 



Lys Thr Ser Thr Cys lie Tyr His 
10 15 

Phe Leu Asn Tyr Tyr lie Ser Gin 
25 30 



<210> 375 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 375 

Pro Thr Glu Gly Arg Gin Lys Val 
1 5 

Ser Ala Leu Ala Met Thr Lys Thr 
20 



Leu Lys Thr Phe Thr Val Pro Arg 
10 15 



<210> 376 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 376 

Phe Leu Asn Tyr Tyr lie Ser Gin Glu Gly Lys Asp Glu Val Lys Pro 
! 5 10 15 
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Lys lie Leu Ala Asn Gly Ala Arg Trp Lys Tyr 
20 25 



<210> 377 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 377 

Phe Lys Asp Gin Leu Val Tyr Pro Leu Leu Ala Phe Thr 
15 10 



<210> 378 
<211> 13 
<212> PRT 

<2 1 3 > Homo s apiens 
<400> 378 

Arg Gin Ala Leu Asn Leu Pro Asp Val Phe Gly Leu Val 
1 5 10 



<210> 379 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Ala Thr Ala Ser His Asp Leu Leu Leu Phe 
15 10 



<210> 380 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (72) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 380 

Met Ser lie Asn lie Cys Leu Met Gin Ser Lys Thr Gin Gly^ Ser Cys 
1 5 10 15 

Gin Tyr Leu Leu Leu Pro His Pro Val Pro lie lie Leu Lys Val Ser 
20 25 30 

Thr Val Phe Ser Leu Leu Ser Leu Phe Arg Leu Leu Phe Leu Ser Phe 
35 40 " 45 



Cys Pro His Pro Lys Lys Cys Ser Tyr Leu Leu 'Lys Tyr Tyr Gly Pro 
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50 55 60 

Leu Glu Gly His Lys Thr Leu Xaa Tyr Leu Arg Thr Asn Leu Gly Val 
65 * 70 75 80 

He Gin Pro Pro Leu Arg Met Tyr Ala Ala Glu Asp Cys Asn Gly He 
85 90 95 



Gly 



<210> 381 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<400> 381 

Met Ser He Asn He Cys Leu Met 
1 5 

Gin Tyr Leu Leu Leu Pro His Pro 
20 

Thr Val Phe Ser Leu Leu Ser Leu 
35 40 



Gin Ser Lys Thr Gin Gly Ser Cys 
10 15 

Val Pro He He Leu Lys Val Ser 
25 30 

Phe Arg Leu Leu Phe Leu 
45 



<210> 382 

<211> 51 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 382 

Ser Phe Cys Pro His Pro Lys Lys Cys Ser Tyr Leu Leu Lys Tyr Tyr 
15 10 15 

Gly Pro Leu Glu Gly His Lys Thr Leu Xaa Tyr Leu Arg Thr Asn Leu 
20 25 30 

Gly Val He Gin Pro Pro Leu Arg Met Tyr Ala Ala Glu Asp Cys Asn 
3 5 40 45 

Gly He Gly 
50 



<210> 383 
<211> 23 
<212> PRT 
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<213> Homo sapiens 
<400> 383 

Lys Glu Glu Asp Asp Asp Thr Glu Arg Leu Pro Ser Lys Cys Glu Val 
1 5 10 15 

Cys Lys Leu Leu Ser Thr Glu 
20 



<210> 384 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 384 

Lys Glu Glu Asp Asp Asp Thr Glu Arg Leu Pro Ser Lys Cys Glu Val 
1 5 10 15 

Cys Lys Leu Leu Ser Thr Glu 
20 

<210> 385 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 385 

Leu Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu 
1 5 10 15 

Leu Gly Gin 



<210> 386 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 386 

Leu Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu 
15 10 15 

Leu Gly Gin 



<210> 387 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 387 
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Arg Gin Ala Val He Val Cys Arg Arg Arg Phe Val 
1 5 10 



<210> 388 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<400> 388 

Pro Pro Arg Trp Ala His Pro Lys Ala Pro Glu Gly Ser Pro Asp Pro 
15 10 15 

Pro Ser Pro Pro Ser Ala Leu Gly Leu Ser Val Leu Pro Trp Ser Asp 
20 25 30 

Ser Asp Pro Trp His He Ser Val Ser Pro Cys Ala Gin Arg Glu His 
35 40 45 

Tyr Ser Pro Gly Ser Ala His He Asn Ser Leu Arg Pro Leu Pro Ala 
50 55 60 

Leu Ser Leu Lys Arg Cys Lys Ala Arg Val Ser Ser Ser Cys Leu Tyr 
65 70 75 80 

Pro Ala Pro Ala Pro Ala Pro Ala Pro Leu Glu He Asp Arg Cys Asp 
85 90 95 

Ser Val Pro Pro Val Ala Leu Cys Ser Ala Ala Tyr Thr Leu Arg He 
100 105 HO 

Cys Trp Ala Ser Val Leu Cys His Arg Pro Pro Pro Ser Thr Ser Gin 
115 120 125 

Pro Lys Pro Arg Ala Arg Pro Lys Lys Gly Lys Ala He Phe Pro Thr 
130 135 140 

Ala Gin Val Pro 
145 



<210> 389 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<400> 389 

Pro Pro Arg Trp Ala His Pro Lys Ala Pro Glu Gly Ser Pro Asp Pro 
1 5 10 15 

Pro Ser Pro Pro Ser Ala Leu Gly Leu Ser Val Leu Pro Trp Ser Asp 
20 25 30 

Ser Asp Pro Trp His He Ser Val Ser Pro Cys Ala Gin Arg Glu His 
35 40 45 
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Tyr Ser Pro Gly Ser Ala His lie Asn Ser Leu Arg Pro Leu Pro Ala 
50 55 60 

Leu Ser Leu Lys Arg Cys Lys 
65 70 



<210> 390 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Ala Arg Val Ser Ser Ser Cys Leu 
1 5 

Ala Pro Leu Glu lie Asp Arg Cys 
20 

Cys Ser Ala Ala Tyr Thr Leu Arg 
35 40 

His Arg Pro Pro Pro Ser Thr Ser 
50 55 

Lys Lys Gly Lys Ala lie Phe Pro 
65 70 



Tyr Pro Ala Pro Ala Pro Ala Pro 
10 15 

Asp Ser Val Pro Pro Val Ala Leu 
25 30 

lie Cys Trp Ala Ser Val Leu Cys 
45 

Gin Pro Lys Pro Arg Ala Arg Pro 
60 

Thr Ala Gin Val Pro 
75 



<210> 391 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Glu Glu Lys Leu Phe Thr Ser Ala Pro Gly Arg Asp Phe Trp Val Met 
15 10 15 

Gly Glu Thr Arg Asp Gly Asn Glu Glu Asn 
20 25 



<210> 392 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 392 

Gin Lys Pro Thr Phe Ala Leu Gly Glu Leu Tyr Pro Pro Leu lie Asn 
15 10 15 

Leu Trp Glu Ala Gly Lys Glu Lys Ser Thr Ser Leu Lys Val Lys Ala 
20 25 30 



WO 99/47540 PCT/US99/05804 

188 



Thr Val lie Gly Leu Pro Thr Asn Met Ser 
35 40 



INTERNATIONAL SEARCH REPORT 



International application Ho. 
PCT/US99/03804 



A CLASSIFICATION OF SUBJECT MATTER 

IPC<6) :Please See Extra Sheet 

US CL :Pleaae See Extra Sheet 
According to International Patent Classification (IPC) or to both national classification and IPC 


n itift nc cvADrtfpn 


Minimum documentation searched (classificatioo system followed by classification symbols) 
U.S. : 536/23J; 435/320.1. 252.3. 69.1, 6, 7.1; 530/300. 388.22; 514/2; 4367501 


Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 


Electronic data base consulted during the international search (n 
Please See Extra Sheet 


ame of data base and, where practicable, search terms used) 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


Database EST; Accession No.:AA331279. ADAMS et al. Initial 
assessment of human gene diversity and expression patterns based 
upon 83 million nucleotides of cDNA sequence. Nature 377 (6547 
Suppl), 3-174 (1995), Nucleotides 1-314. 


1-12, 14-21 


A 


SCHWARTING, R. et al. Biochemical characterization and 
purification of human B Cell Stimulatory Factor (BSF). Eur. J. 
Immunol. 1985, Vol. 15, pages 632-637, see entire document. 


1-21 


| 1 Further documents are listed in the continuation of Box C. | j See patent family annex. 




* Special categories of cited documents: 

*A* document defining the general state of the art which is not considered 
to be of particular relevance 

"B" earlier document published on or after the international filing date 

"L" document which may throw doubts on priority clatm(s) or which is 
cited to establish the publication date of another citation or other 
special reason (as specified) 

■CT document referring to an oral disclosure, use. exhibition or other 

*P* document published prior to the international filing data but later than 
the priority date claimed 


"T* later document published after the international filing date or priority 
data and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

"X" document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

"Y* document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

"A" document member of the same patent family 


Date of the actual completion of the international search 
07 JUNE 1999 


Date of mailing of the international search report 

07 JUL 1999 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington. D.C 20231 
Facsimile No. (703) 305-3230 


Authorized officer 

EILEEN CHARA >*OT_ 
Telephone No. (703) 308-0196 



Form PCT/1SA/210 (second sheetXJuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US99/05804 



Box I Observation, where certain claims were found unsearchable (Continuation oft tern 1 of first sheet) 



This international report has not been established in respect of certain claim, under Article l7(2Xa) for the following reasons: 
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because they relate to subject matter not required to be searched by this Authority, namely: 



□ ^31aims r^os * 
because they relate to parts of the international application that do not comply with the prescribed requirement, to such 
an extent that no meaningful international search can be carried out. specifically: 



3. I I Claims Nos.: . . . t 

*— 1 because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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This International Searching Authority found multiple inventions in this international application, as follows: 
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I. (—j As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 
claims. 



2 . Q As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. j~| As only some of the required additional search fees were timely paid by the applicant, this international search report covers 
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536/23.5; 435/320.1. 252.3. 69.1. 6. 7.1; 530/300. 388J22; 514/2; 436/501 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

SEQUENCE DATA BASE MPSRCH: EST, OcnEmbl. N_Ocncscq_34. Issucd_Patents_NA, SPTREMBL8, 
SwissProt 36, PIR_58,Issued Patents AA, (SEQ ID NOS: 1 1 and 108 only). One nucleotide sequence and one amino 
acid sequence have been searched. It is not clear which sequences are embraced by the claims because the claims refer 
to sequences X and Y. The table at pages 180-188 contains many sequences X and Y, yet the claims refer to X and Y 
in the singular only. Accordingly, the first X nucleotide sequence disclosed and the first Y amino acid sequence 
disclosed were searched. 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group I, claims 1-21, drawn to nucleic acid molecules, vectors, host cells containing recombinant nucleic acid molecules, 
polypeptides, antibodies, a method of producing a polypeptide, a method for treating a medical condition comprising 
administering a polypeptide, a method of diagnosing a pathological condition by genetic analysts or protein assay, and a 
method for identifying a binding partner to a polypeptide, for gene 1, the nucleic acid molecule identified by SEQ ID 
NO: 11, and the polypeptide identified by SEQ ID NO: 108. as listed in the table on pages 180-188 of the description. 
Groups II through XCV, claims 1-21 for each group, drawn to nucleic acid molecules, vectors, host cells containing 
recombinant nucleic acid molecules, polypeptides, antibodies, a method of producing a polypeptide, a method for treating 
a medical condition comprising administering a polypeptide, a method of diagnosing a pathological condition by genetic 
analysis or protein assay, and a method for identifying a binding partner to a polypeptide for, genes 2 through 95, the 
nucleic acid molecules identified by SEQ ID NOS: 12 through 105, and the polypeptides identified by SEQ ID NOS: 109 
through 202 respectively, as listed in the table on pages 180-188 of the description. 



The inventions listed as Groups I through XCV do not relate to a single inventive concept under PCT Rule 13.1 because, 
under PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: Pursuant 
to 37 C.F.R.J 1.475(b-d), the ISA/US considers that where multiple products and processes are claimed, the main 
invention shall consist of the first invention of the category first mentioned in the claims and the first recited invention 
of each of the other categories related thereto. 

Pursuant to 37 C.F.R. § 1.475(b-dX the ISA/US considers that where multiple products and processes are claimed, the 
main invention shall consist of the first invention of the category first mentioned in the claims and the first recited 
invention of each of the other categories related thereto. Accordingly, the main invention (Group I) comprises the first 
recited product, a polynucleotide comprising gene No. 1, identified by SEQ ID NO:ll, the polypeptide it encodes, 
identified by SEQ ID NO: 108, methods of producing the polypeptide, and methods of using the polynucleotide and 
polypeptide. Further pursuant to 37 C.F.R. § l.475(b-d), the ISA/US considers that any feature which the subsequently 
recited products and methods share with the main invention does not consitute a special techincal feature within the 
meaning of PCT Rule 13.2 and that each of such products and methods accordingly defines a separate invention. 
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